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Count on me 


Sometimes, the use of metrics to assess the value of scientists is unavoidable. So let’s come up with 


the best measure possible. 


be judged holistically — balancing quantitative measures such 

as their publication record against indications of their poten- 
tial from recommendation letters, personal interactions and other 
activities. So even if a candidate had not generated many papers, 
it would count in their favour if the few they had published had 
received positive post-publication review (comments, tweets and 
blog posts, for instance). Also favourable would be a tendency to 
ask insightful questions at talks that lead to valuable discussions and 
new experiments, or a willingness to share reagents and expertise 
with their colleagues. That would be ideal. But that is not the world 
in which most scientists live. 

Instead, hiring committees and grant reviewers sweat through 
hundreds of applications, often with only enough time to give each 
submission a cursory glance. In 2010, a Nature poll found that most 
administrators say that metrics — quantifiable measures of scien- 
tists’ achievements — matter less in job decisions than scientists 
often think (see Nature 465, 860-862; 2010), but good peer review 
is often simply not possible. 

Asa result, evaluators are increasingly turning to metrics, such as 
total citation count and the h-index, a measure of both the quality 
and quantity of papers (a scientist has an h-index of 12 if they have 
published 12 papers that have each received at least 12 citations). 
Naturally, many scientists object to such cold quantification of their 
contribution. Plus, all metrics have obvious flaws — a paper may 
gather many citations not because of its importance, but because it 
is in a large field that publishes frequently, so generates more oppor- 
tunities for citations. Review articles, which may not add much to 
the research, count the same as original research papers, which 
contribute a great deal. And all existing metrics capture only what 
a scientist has done, not what he or she might be capable of. Clearly, 
there is a need for more and better measures. 

On page 201, Daniel Acuna, Stefano Allesina and Konrad Kording 
suggest an alternative: the future h-index. Unlike other metrics, this 
index estimates a scientist's publication prowess five years or so into 
the future — a useful timescale for tenure decisions. 

Using publicly available data on the history of publication, 
citation and funding for thousands of neuroscientists, researchers 
working on the fruitfly Drosophila and evolutionary biologists, the 
authors constructed an algorithm that converts information on a 
typical scientist’s CV — the number of journals published in and 
articles in top journals for instance — into a number that represents 
their probable h-index in the years that follow. 

Outraged? Please send complaints to the usual address. Interested? 
Calculate your own future h-index here: go.nature.com/z4rroc. 

Nature receives thousands of submissions a year, some of which 
point out the flaws in existing metrics and propose alternatives. 
We accepted the piece by Acuna et al. after submitting it to peer 


I: an ideal world, scientists applying for grants or jobs would 


review. The reviewers and our editors felt that the authors had 
used appropriate methods to obtain their algorithm, and its pre- 
dictive values seemed realistic. Furthermore, the authors are cau- 
tious about its value, pointing out that it is probably less accurate 
for scientists in other disciplines, and should not be considered a 
replacement for peer review. At the very least, the future h-index 
should help to address some problems with the current h-index, 
which tends to favour established scientists because they have had 

more time to accrue citations. A forward- 


“There is no looking metric may give a leg up to prom- 
substitute for ising, early-career scientists who don’t yet 
examining the have impressive CVs. 

research itself Nevertheless, no one wants their career 
to appreciate its potential to be reduced to a number. Nature 
value.” publishes many scientific gems that never- 


theless achieve few citations; there is no 
substitute for examining the research itself to appreciate its value. 
We know that the idea of a new metric published in these pages 
will raise some anxieties, and a few hackles. But metrics are already 
being used, so it is important that they create the most accurate 
picture possible of someone’s potential. Plus, they do hold some 
advantages over peer review, by helping to eliminate the uncon- 
scious biases that can creep into personal evaluations. 

In that vein, scientists should continue to hunt for metrics that 
capture a scientist’s true value, including aspects such as teaching, 
reviewing and public-speaking ability, as well as online responses to 
publications in blogs and comments — ‘alt-metrics: We may not live 
in an ideal world, but we can still improve the recruitment, reward 
and opportunities for scientists. m 


Secret weapons 


US military furtiveness is hindering progress 
and the development of technology. 


scientists sketched out a plan to build a microwave weapon to shoot 

down enemy bombers. That idea, perhaps the earliest description 
ofan electromagnetic bomb, encapsulates much of what military offi- 
cials still hope to achieve with such weapons: disabling electronics 
(or, in some cases, people) using a powerful energy beam, without 
causing any collateral physical damage. The US military’s attempts to 
make a practical weapon based on this idea have so far resulted in only 
one system — at least as far as it has revealed publicly. The Air Force 


E the 1940s, with the Second World War in full swing, Japanese 
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has built the Active Denial System, a non-lethal high-power micro- 
wave weapon supposedly able to deter an angry mob by creating the 
sensation of being burned. 

For decades, the US military has conducted much of its research 
on such weapons in secret. It has often hinted that it is on the verge 
of a breakthrough, yet high-power microwave weapons are notice- 
ably absent from modern battlefields and scenes of civil unrest. The 
military, for the most part, won't discuss its progress — or lack thereof 
— citing secrecy in the name of national security. 

There is nothing unique about the classification of this research: 
nuclear weapons, stealth aircraft and satellite reconnaissance systems 
were all developed in secret. Although such furtiveness can legiti- 
mately protect US weapons and capabilities, it can also prevent much- 
needed dissemination of scientific research. And it has all too often 
concealed a lack of progress. 

As we discuss on page 198, this has been the problem with the 
programme to develop high-power microwave weapons: the little 
information that has been released points to obvious scientific and 
technological problems. Crucially, power sources for such devices 
are often too unwieldy to use. More than ten years after the Active 
Denial System was first revealed to the public, its size and complexity 
mean that it is still nearly impossible to deploy. The military rejected 
the system for use at checkpoints in Iraq because it would have taken 
16 hours to cool down the weapon’s pulse generator to superconduct- 
ing temperatures to fire it. 

Many records related to the Active Denial System remain classified 
and inaccessible to the public and the scientific community. The US 
Air Force’s unwillingness to reveal the full scope of its research into 
the biological effects of high-power microwaves in the 1990s, which 
included work on their auditory and lethal effects, flies in the face 
of the defence department's claims that it is interested in classifying 
only weapons technology, and not science. If, as the Air Force says, 
the biological research never led to weapons, then there is no reason 
not to release it. 

Work on high-power microwaves designed to take out electronics 
has not fared much better. Advocates can always claim that classified 


programmes are yielding great progress, but information in the public 
sphere does not paint a rosy picture. Military officials and academ- 
ics acknowledge that developing compact power sources remains the 
biggest hurdle. The Air Force and a contractor have touted efforts 
to develop a high-power microwave cruise missile, but neither will 
release details that might allow independent experts to judge the 
programme's potential. The Pentagon is staying quiet on a system 

developed to take out improvised explosive 


“The government devices, but what little information is avail- 
must be willing able indicates that — like the Active Denial 
tosharedataand System — it has proved too cumbersome to 
findings between _ use effectively. 

military labs and This is not to say that all government 


spending on high-power microwaves is 
a waste. Academic funding under Multi- 
disciplinary University Research Initiatives is contributing to a host 
of peer-reviewed publications and collaborative research. But for the 
government to take full advantage of that research, it must be willing 
to share data and findings between military labs and academia. The 
defence department's own science board has found that reluctance 
to share is a barrier to progress. 

Getting to the truth about high-power microwaves requires trans- 
parency. Independent experts must be able to scrutinize technology 
to enable scientific—-military cooperation and to provide a reality 
check for those who make fantastic claims about a weapon's potential. 

By the time it cancelled the Airborne Laser programme earlier this 
year, the US defence department had poured billions of dollars into the 
weapon: a chemical laser in the nose of an aircraft, designed to shoot 
down ballistic missiles. In the end, the question was not whether the 
laser would work, but whether it would be usable, given the scientific 
and technological practicalities of integrating such a complex system. 
“There’s nobody in uniform that I know who believes that this is a 
workable concept,” concluded former US defence secretary Robert 
Gates, when he finally moved to kill the project. The same concerns 
would probably be expressed about high-power microwaves — if more 
information about them were available. m 


academia.” 


The name game 


After several years of wrangling, zoologists 
can now name new species online only. 


ith access to the Internet, the (official) world of animals and 
Wiens will soon be at your fingertips. In a landmark ruling, 

zoologists last week agreed that newly identified species 
can be named in online-only publications. Previously, the first official 
description of anything that crawled, flew, wriggled, walked or swam 
across Earth needed to be formally written up and recorded in print, 
where it would remain in perpetuity for future scientists to reference. 

That made sense when Henry Fairfield Osborn described Tyran- 
nosaurus rex in 1905; less so when Rob Gay named a new theropod, 
Kayentavenator elysiae, in 2010, which helps to explain why Gay broke 
with convention and claimed the first description of the species ina 
self-published print-on-demand book. 

As technology blurred the distinction between what is published and 
what is not, some predicted online anarchy, with ‘taxonomic vandals 
taking to the Internet to self-publish reports of new species. An obvi- 
ous solution to the problem would have been to extend the rules from 
print to cover online scientific journals, and to draw the line there. But 
there were concerns about whether online journals would endure. Ina 
messy compromise, online journals that published descriptions of new 
species printed and bound several dozen copies of the paper — in case a 
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twenty-second-century palaeontologist should call. In an even messier 
compromise, some scientists printed papers from journal websites and 
posted them to libraries themselves. 

No more. The International Commission on Zoological Nomen- 
clature (ICZN), which sets the rules for the naming of new species, 
announced on 4 September that it was relaxing its code to encompass 
publication in online-only publications. The change, which followed a 
vote of 23 in favour to 3 against, with one abstention, comes into force 
at the start of next year. The amendment allows for descriptions of 
new species in “widely accessible electronic copies with fixed content 
and layout”. New animal species will also need to be registered with 
ZooBank.org, the official registry of the ICZN. 

It is a sensible move, and one that most in the field should welcome. 
It comes a year after the International Botanical Congress endorsed 
online-only publication for new types of plant. In an Editorial at the 
time (Nature 475, 424; 2011), which called on zoologists to follow 
suit, Nature said: “At this point, it seems that there is little reason to 
continue to demand paper on a shelf to make a species name official.” 

On hearing the news from the ICZN, one member of Nature’s staff 
quipped: “Now to name a dinosaur you don't have to behave like one” 
But that is a little unfair. Proper taxonomy and a robust archive are 
crucial to science, and the zoologists were right to consider with care 
the possible negative aspects of such a change, as well as listening to the 
clamour to embrace the new. True, the change 
has been along time coming. It is overdue, even. 
Still, when you have been dead and waiting for 
a name since the Mesozoic era, what are a few 
extra years? m 
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WORLD VIEW .jernsicorsen 


hen e-mail first became available, it was a great innova- 
Wie that made communication fast and cheap. Then came 

spam — and suddenly, the innovation wasn't so great. It 
meant having to filter out irrelevant, deceptive and sometimes offen- 
sive messages. It still does. 

The same corruption ofa great idea is now occurring with scholarly 
open-access publishing. 

Early experiments with open-access publishing, such as the Journal 
of Medical Internet Research and BioMed Central, were very promis- 
ing. Set up more than a decade ago, they helped to inspire a social 
movement that has changed academic publishing for the better, low- 
ered costs and expanded worldwide access to the latest research. 

Then came predatory publishers, which publish counterfeit journals 
to exploit the open-access model in which the 
author pays. These predatory publishers are dis- 
honest and lack transparency. They aim to dupe 
researchers, especially those inexperienced in 
scholarly communication. They set up websites 
that closely resemble those of legitimate online 
publishers, and publish journals of questionable 
and downright low quality. Many purport to be 
headquartered in the United States, United King- 
dom, Canada or Australia but really hail from 
Pakistan, India or Nigeria. 

Some predatory publishers spam research- 
ers, soliciting manuscripts but failing to men- 
tion the required author fee. Later, after the 
paper is accepted and published, the authors are 
invoiced for the fees, typically US$1,800. Because 
the scientists are often asked to sign over their 
copyright to the work as part of the submission 
process (against the spirit of open access) they feel unable to withdraw 
the paper and send it elsewhere. 

I monitor predatory publishers on my blog, Scholarly Open Access, 
which has become a forum in which scientists can raise their concerns 
over the practice. They send me hundreds of e-mails passing on spam 
solicitations or asking whether a particular publisher is legitimate. 

Ialso get e-mails from the predators’ victims. Some have been named 
as members of editorial boards without their knowledge or permis- 
sion. Others have had an article partially or completely plagiarized in 
a predatory journal. Many ask me for advice on where to publish or 
how to withdraw an article that they wish they hadn't submitted. As a 
librarian, I do my best to answer the questions I receive, but they often 
require expertise in the author’s field of study. So it is important that 
more scientists are made aware of the problem. 


The predatory publishers and journals often NATURE.COM 
have lofty titles that make them seem legiti- _ Discuss this article 
mate ina list of publications on a CV. Scholarly online at: 
publishing’s traditional role of vetting the best _go.nature.com/hnuuly 


SCIENTIFIC LITERACY 


MUST 
INCLUDE 


THE ABILITY TO 
RECOGNIZE 


PUBLISHING 
FRAUD. 


Predatory publishers are 
corrupting open access 


Journals that exploit the author-pays model damage scholarly publishing 
and promote unethical behaviour by scientists, argues Jeffrey Beall. 


research is disappearing. Now there is a journal willing to accept 
almost every article, as long as the author is willing to pay the fee. 
Authors, rather than libraries, are the customers of open-access pub- 
lishers, so a powerful incentive to maintain quality has been removed. 

Perhaps nowhere are these abuses more acute than in India, where 
new predatory publishers or journals emerge each week. They are 
appearing because of the market need — hundreds of thousands of 
scientists in India and its neighbouring countries need to get published 
to earn tenure and promotion. 

Here, the problem is not just with the publishers. Scientists them- 
selves are also to blame. Many are taking unethical shortcuts and pay- 
ing for the publication of plagiarized or self-plagiarized work. 

Honest scientists stand to lose the most in this unethical quagmire. 
When a researcher's work is published alongside 
articles that are plagiarized, that report on con- 
clusions gained from unsound methodologies 
or that contain altered photographic figures, 
it becomes tainted by association. Unethical 
scientists gaming the system are earning tenure 
and promotion at the expense of the honest. 

The competition for author fees among fraudu- 
lent publishers is a serious threat to the future of 
science communication. To compete in a crowded 
market, legitimate open-access publishers are 
being forced to promise shorter submission-to- 
publication times; this weakens the peer-review 
process, which takes time to do properly. 

To tackle the problem, scholars must resist 
the temptation to publish quickly and easily. 
The research community needs to use scholarly 
social networks such as Connotea and Mendeley 
to identify and share information on publishers that deceive, lack 
transparency or otherwise fail to follow industry standards. Scientific 
literacy must include the ability to recognize publishing fraud, and 
libraries must remove predatory publishers from their online cata- 
logues. The worst offenders can usually be discovered without too 
much effort: their websites are littered with grammatical errors and 
they list bogus contact details. The borderline cases are more difficult 
to spot — here, we need open-access zealots to open their eyes to the 
growing quality problems. 

Conventional scholarly publishers have had an important role in 
validating research, yet too often advocates of open access seem to 
overlook the importance of validation in online publishing. They 
promote access at the expense of quality: a shortcoming that tacitly 
condones the publication of unworthy scientific research. m 


Jeffrey Beall is Scholarly Initiatives Librarian at the University of 
Colorado Denver. 
e-mail: jeffrey. beall@ucdenver.edu 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


WILEY-VCH VERLAG GMBH & CO. KGAA, WEINHEIM 


On the origin of 
la supernovae 


Using a computer model, 
astronomers in Japan present 
evidence to support one 
theory about the origin of 
type Ia supernovae, which 
are of predictable luminosity 
and are therefore useful for 
estimating distances to remote 
galaxies. These supernovae 
are generated when a compact 
star called a carbon-oxygen 
white dwarf siphons off a 
critical amount of matter from 
a companion star. However, 
astronomers have yet to 
directly observe a companion 
star. 

Izumi Hachisu at the 
University of Tokyo and 
his team offer a solution 
to this conundrum. Their 
model, which accounts for 
the rotation of white dwarfs, 
shows that many companion 
stars have evolved to a state too 
faint to be seen. The model is 
consistent with observations 
of supernova 201 lfe. 
Astrophys. J. 756, L4 (2012) 


Hydrogel makes 
buildings sweat 


Just as mammals can reduce 
their body temperature by 
sweating, so a coating of heat- 
sensitive hydrogel could ‘sweat’ 
to cool buildings. 

Wendelin Stark and his 
colleagues at the Swiss Federal 
Institute of Technology 
in Zurich produced a 
3-millimetre-thick layer of 


igo 


ANIMAL BEHAVIOUR 


Seals see glowing prey 


Female southern elephant seals are thought 

to locate their food in deep, dark waters by 
detecting the bioluminescence of their prey. The 
seals see best at the wavelength of light produced 


by bioluminescent organisms. 


Jade Vacquié-Garcia at the Chizé Centre for 
Biological Studies in Villiers-en-Bois, part of the 
French National Centre for Scientific Research, 
and her team tested the idea by studying four 


thermoresponsive hydrogel. 
When heated to roughly 32°C, 
the gel undergoes a phase 
transition from a wet state to 

a dry state, and releases water. 
With this water could go much 
of a building’s heat. 

Miniature model houses 
(pictured) with roofs coated 
with the substance were up to 
20°C cooler than uncoated 
models when exposed to 
simulated tropical midday 
Sun. The authors estimate 
that this translates into annual 
savings of 220 kilowatt-hours 
of energy for a detached house. 
A brief ‘rai’ recharged the 
hydrogel with water. 

Adv. Mater. http://dx.doi.org 
/10.1002/adma.201202574 
(2012) 
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female southern elephant seals (Mirounga 
leonina; pictured) living on the Kerguelen islands 
in the southern Indian Ocean. The researchers 
used satellite tracking to follow the animals 


as they made a total of 3,386 dives, anda light 


MOLECULAR BIOLOGY 


Red-blood-cell 
regulator 


Human genome-wide 
association studies have 
indicated that acommon 
single nucleotide sequence 
variant influences the size and 
number of red blood cells. A 
team led by Harvey Lodish 
of the Whitehead Institute, 
and Eric Lander of the Broad 
Institute, both in Cambridge, 
Massachusetts, found that 
this variant in the non- 
coding sequence reduces the 
expression of the nearby gene 
CCND3, which is involved in 
controlling the cell cycle. 
Reducing the expression of 


sensor to measure bioluminescence. During the 
dives, the seals’ foraging intensity was positively 
related to the number of bioluminescence events. 
PLoS ONE 7, 43565 (2012) 


the gene in human and mouse 
red-blood-cell precursors 
caused the cells to go through 
fewer divisions, resulting in 
fewer, but bigger, red blood 
cells. 

Genes Dev. http://dx.doi. 
org/10.1101/gad.197020.112 
(2012) 


Populations 
cooperate 


Populations of naturally 

co-occurring bacteria work 

together, an analysis suggests. 
To test whether bacteria 

aggregate for reasons beyond 

access to resources such 

as food, Martin Polz at the 


E. CAMPRASSE/C. GUINET FIELD TEAM CEBC-CNRS 


Massachusetts Institute of 
Technology in Cambridge and 
his team created a network 
map of interactions between 
185 strains of Vibrionaceae 
bacteria. They found that 
antagonistic interactions 
were much less likely between 
strains that are as closely related 
as those within naturally 
occurring populations. 

The authors suggest 
that the populations act 
as cohesive units that can 
defend themselves against 
other groups of bacteria. The 
coordination seems to have 
arisen through horizontal gene 
transfer, in which population 
members capable of producing 
antibiotics passed on 
antibiotic-resistance genes to 
their neighbours. 
Science 337, 1228-1231 (2012) 


Ecosystems 
compete 


Bacterial species seem to 
compete more than they 
cooperate. 

Kevin Foster at the 
University of Oxford, UK, 
and Thomas Bell at Imperial 
College London tested the 
theory that different bacterial 
species can form cooperative 
‘superorganisms. The team 
measured the productivity 
— in terms of CO, given off 
through respiration — of 
arange of multi-species 
microbial cultures. After both 
one week and one month, 
species in the mixtures were 
less productive than they 
would have been if grown 
separately, suggesting that 
social living comes at a cost. 
Curr. Biol. http://dx.doi. 
org/10.1016/j.cub.2012.08.005 
(2012) 


Atreatable form 
of autism 


A rare form of autism that is 
accompanied by epilepsy and 
intellectual disability could 
be treatable with dietary 
supplements. 

Joseph Gleeson and Gaia 


Novarino at the University 

of California, San Diego, and 
their team identified three 
affected families that carry 
mutated versions of a gene 
called BCKDK. The gene 
encodes an enzyme that slows 
the breakdown of branched- 
chain amino acids such as 
leucine and valine. Mice that 
lack both copies of the gene had 
low serum and brain levels of 
these amino acids, and showed 
neurological abnormalities 
and stunted growth. Enriching 
the animals’ diets in branched- 
chain amino acids reversed 
these deficits. 

Similar dietary enrichment 
normalized serum levels of 
branched-chain amino acids 
in people affected by the 
BCKDK mutation. 

Science http://dx.doi. 
org/10.1126/science.1224631 
(2012) 


CONSERVATION BIOLOGY 


Tigers and people 
can coexist 


As the world becomes more 
crowded, setting aside time, 
as well as space, for other 
species could help humans to 
live harmoniously alongside 
wildlife. 

Jianguo Liu of Michigan 
State University in East 
Lansing and his team came up 
with the idea after analysing 
footage from motion-activated 
cameras they had set up in 
and around Nepal’s Chitwan 
National Park. The team 
detected high levels of both 
human and tiger activity in 
the park, and found that the 
geographical activity patterns 
of the two species overlapped: 
any given camera was likely 
to collect images of both. 

Yet there was little conflict 
between humans and tigers 
in the park, probably because 
80% of tiger detections 
occurred at night, when 
humans are generally at home. 
Outside the park, where 
humans are even thicker 

on the ground, 95% of tiger 
detections were at night. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1210490109 (2012) 
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COMMUNITY 


CHOICE 


Calm Sun promotes chilly winters 


Unusually cold central European winters 
— such as those of the past two years — 
could be due to atmospheric circulation 
anomalies caused by weak solar activity. 
Frank Sirocko of the Johannes Gutenberg University of 
Mainz, Germany, and his team compared historical records 
of freezing of the river Rhine with a time series of observed 
sunspot activity. The researchers found that 10 of the 14 major 
freezing events that have occurred since 1780 took place in or 
around years with minimal numbers of sunspots, when solar 


3 HIGHLY READ 


activity was weak. 


The effects of weak solar activity may propagate to the 
lowest layer of Earth’s atmosphere, leading to changes in 
pressure systems over the North Atlantic. These changes 
would favour the sustained flow of chilly Arctic air to central 
Europe during winter months, the team suggests. 

Geophys. Res. Lett. http://dx.doi.org/10.1029/2012GL052412 


(2012) 


PLANT BIOTECHNOLOGY 


Engineered plants 
can use phosphite 


Many crops depend on 
phosphorus-based fertilizers 
and are threatened by 
herbicide-resistant weeds. 
Genetically modified plants 
that can digest an alternative 
phosphorus source that 
normally inhibits plant growth 
could solve both problems. 
Damar Léopez-Arredondo 
and Luis Herrera-Estrella 
at the National Polytechnic 
Institute's Centre for Research 
and Advanced Studies in 
Irapuato, Mexico, engineered 
the model plant Arabidopsis, 
and tobacco plants, to 
metabolize phosphite, in 
addition to the orthophosphate 


found in standard fertilizer. 


When phosphite was 
available, the transgenic 
plants needed 30-50% less 
phosphorus to generate the 
same amount of biomass as 
that produced in the presence 
of orthophosphate. The 
researchers also tested the 
transgenic plants against 
weeds. In the presence of 
orthophosphate, weeds 
dominated the transgenic 
plants, but with the addition of 
phosphite, transgenic tobacco 
plants easily outcompeted 
weeds (pictured). 

Nature Biotechnol. http://dx.doi. 
org/10.1038/nbt.2346 (2012) 
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POLICY 


Missile defence 

The US missile-defence 
programme should end costly 
efforts to catch missiles in 

the early stages of launch 

and strengthen its ability to 
intercept them in mid-course, 
says Making Sense of Ballistic 
Missile Defence, a study 

from the National Academy 
of Sciences published on 

11 September. Research into 
expensive space-based radars 
should be slashed, and efforts 
to distinguish warheads from 
dummies intensified, it says. 


Climate-model plan 
US climate modellers should 
work towards better ways 

of developing, interpreting 
and comparing climate 

and weather models, the 
National Research Council 
concludes in a report released 
on 7 September. A National 
Strategy for Advancing Climate 
Modeling recommends that the 
community try to standardize 
software platforms so that 
they are ready for the next 
generation of supercomputers. 
See go.nature.com/b2ymhi 

for more. 


Nuclear future 
The United Arab Emirates 
has started to build its first 
nuclear reactor, making it 


JOURNALISM AWARD 
Gayathri Vaidyanathan, 

who worked at Nature 
during a fellowship from the 
International Development 
Research Centre in Ottawa, 
Canada, has won the Evert 
Clark/Seth Payne Award for 
young journalists. The award 
cited ‘The wheat stalker’ 
(Nature 474, 563-565; 
2011) and ‘The cultured 
chimpanzees’ (Nature 476, 
266-269; 2011), and two 
pieces published elsewhere. 


Yellow river turns red 


A stretch of the Yangtze River near Chongqing 
in southwest China turned red on 6 September. 
What caused the sudden change in the river's 
colour is still unclear, but freshwater scientists 
contacted by Nature say that it is unlikely to 


the first country to embark 
ona commercial nuclear 
programme since China 

in 1985. On 30 August, 
reactor unit 1 at Barakah was 
reported by the International 
Atomic Energy Agency in 
Vienna as being officially 
“under construction” The 
reactor is the first of four that 
will be supplied by a South 
Korean consortium at a total 
cost of US$20 billion. The 
1,400-megawatt pressurized 
water reactor is scheduled to 
begin operations in 2017. 


Shale-gas risks 
Drilling for shale gas might 
contaminate water resources, 
damage biodiversity and 
pollute the air — and such 
environmental risks are 

not adequately mitigated 

by existing legislation, says 

a study published by the 
European Commission on 

7 September. A second study 
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pollution. 


notes that European shale 

gas would generate more 
greenhouse-gas emissions 
than conventional gas — but 
fewer than would be produced 
by importing gas into the 
European Union. 


Oil exploration 

On 9 September, Shell began 
oil exploration in the Chukchi 
Sea off Alaska, the first 
offshore drilling in the sea 

for more than 20 years. The 
drilling is designed to reach 
an oil reservoir at a depth of 
2,400 metres, but is unlikely 
to be finished this year: Shell's 
current permit is valid only 
until 24 September because 
of the threat of encroaching 
sea ice. Shell has requested an 
extension, arguing that sea 

ice is likely to arrive later this 
year. See go.nature.com/kjdtih 
for more. 


be attributable to an algal bloom because such 
algae do not grow in oxygen-rich, fast-flowing 
river waters. Last December, the Jian River in 
northern China turned red asa result of dye 


Pe RESEARCH 
New species online 


Animal species can now 

be described and named in 
electronic-only publications, 
the International Commission 
on Zoological Nomenclature 
announced on 4 September. 
Previously, descriptions 

of new animal species had 

to be published in print to 

be considered bona fide. 
Botanists endorsed electronic 
publication last year (see 
go.nature.com/ufq7sv). See 
page 178 and go.nature.com/ 
pta%6e for more. 


Drilling depth 

The deep-sea drilling vessel 
Chikyu has recovered rock 
samples from more than 

2.1 kilometres beneath the 

sea floor, the Japan Agency 

for Marine-Earth Science and 
Technology announced on 

6 September. Chikyu is drilling 
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off the coast of Japan north of 
the fault zone responsible for 
the 2011 Tohoku earthquake. 
Although oil wells frequently 
reach deeper, this is the 

first scientific expedition to 
retrieve core samples from 
such depths. See go.nature. 
com/vxrerg for more. 


Dengue trial 

A dengue vaccine protected 
Thai schoolchildren from 
some forms of the mosquito- 
borne virus, reports a study 
published on 10 September 
(A. Sabchareon et al. Lancet 
http://doi.org/jbq; 2012). 
Overall, 2.8% of 2,669 
children who received the 
vaccine developed dengue, 
compared with 4.4% of the 
1,333 in a control group, a 
result that was not statistically 
significant. But a secondary 
analysis found that the vaccine 
was 60-90% effective against 
three out of four genetic forms 
of dengue. 


PS PEQPLE 
Telescope chief 


The world’s largest radio- 
telescope project has chosen 
a director-general. On 

5 September, the board of 
the Square Kilometre Array 
appointed Phil Diamond to 
oversee construction of the 
€1.5-billion (US$1.9-billion) 
project, which will have sites 
in Australia and South Africa 
(see Nature 485, 555-556; 


TREND WATCH 


The world’s network of protected 
areas is growing rapidly, but not 
fast enough to meet the 2020 Aichi 
biodiversity targets, according to 
the trend detailed in the United 
Nations’ Protected Planet report, 
released on 7 September at the 
IUCN World Conservation 
Congress in Jeju, South Korea. 
The proportion of area managed 
exclusively by governments has 
fallen, as conservation schemes 
based on local community 
management (often by indigenous 
peoples) play a growing part. See 
go.nature.com/dgve5x for more. 


2012). Diamond (pictured) 
is currently serving as the 
head of astronomy and 
space science at Australia’s 
Commonwealth Scientific 
and Industrial Research 
Organisation in Marsfield. 
He replaces Michiel van 
Haarlem, who has served as 
interim director-general since 
December 2011. 


Hauser disciplined 


Marc Hauser, a former 
psychologist at Harvard 
University in Cambridge, 
Massachusetts, engaged in 
research misconduct, the US 
Office of Research Integrity 
concluded on 5 September. 
He neither admits nor denies 
the charges. See page 189 

for more. 


Football donation 
The US National Football 
League (NFL) is donating 
US$30 million to the 
Foundation for the National 


Institutes of Health to support 
research into brain injuries. 
The money will help to 
establish a Sports and Health 
Research Program at the 
National Institutes of Health, 
to fund research on chronic 
traumatic encephalopathy 
and other sport-related brain 
injuries. Meanwhile, a study 
in Neurology (E. J. Lehman 

et al. Neurology http://doi. 
org/h97; 2012) has reported 
that professional American 
football players are more likely 
than the general population 
to have Alzheimer’s disease or 
the neurodegenerative disease 
amyotrophic lateral sclerosis 
as an underlying cause of 
death. 


AWARDS 


Balzan prizes 

Kurt Lambeck, a geophysicist 
at the Australian National 
University in Canberra, has 
been awarded one of this 

year’s two 750,000-Swiss-franc 
(US$790,000) Balzan prizes 

in science for his research on 
sea-level changes and glacial 
cycles. Plant scientist David 
Baulcombe of the University of 
Cambridge, UK, won the other 
for his work on epigenetics. 


Lasker award 

The US$250,000 Albert Lasker 
Basic Medical Research Award 
has been awarded to biologist 
Michael Sheetz of Columbia 
University in New York city, 


PROTECTED AREAS GROW — SLOWLY 


The extent of the world’s protected areas has increased 48% since 
1990 — but more are needed to reach 2020 biodiversity targets. 
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SEVEN DAYS | THIS WEEK | 


14 SEPTEMBER 
More details on 

the workings of the 

US National Institutes 
of Health’s youngest 
centre, the National 
Center for Advancing 
Translational Sciences, 
are revealed at two key 
meetings in Bethesda, 
Maryland. The centre's 
advisory council and 
its Cures Acceleration 
Network Review Board 
both convene. 
go.nature.com/zablws 


17-21 SEPTEMBER 
Nuclear safety and 
security are discussed 
at the annual general 
conference of the 
International Atomic 
Energy Agency in 
Vienna. 
go.nature.com/dfaznk 


biochemist James Spudich of 
the Stanford University School 
of Medicine in California and 
cell biologist Ronald Vale of 
the University of California, 
San Francisco, for their 

work on cytoskeletal motor 
proteins, which underlie 
cellular transport and muscle 
contraction. Winners of the 
award often go on to receive a 
Nobel prize. 


Dawn departure 


NASAs mission to explore 

the asteroid belt’s two most 
massive bodies departed 
from its first target, Vesta, 

on 5 September. The Dawn 
probe had been mapping 
Vesta since July 2011. It is now 
heading for an early-2015 
appointment with Ceres, the 
dwarf planet that is thought to 
harbour substantial amounts 
of subterranean ice. See 
go.nature.com/pgoftk 

for more. 
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Larger areas of open ocean and warming surface waters will alter evaporation, cloudiness, precipitation and storm patterns across the Arctic and beyond. 


Ice loss shifts Arctic cycles 


Record shrinkage confounds models and portends atmospheric and ecological change. 


BY QUIRIN SCHIERMEIER 


B efore indifferent satellite eyes, the top 


of the world is undergoing a transfor- 
mation. The Arctic ice pack, a primary 
indicator of climate change, has shrunk in 
recent weeks to an extent that no computer 
model and few scientists had thought possible. 
After five years that all saw less ice than 
previously documented in the 34-year satel- 
lite record, this year’s record loss has scien- 
tists questioning their models. They are also 
striving to understand the complex cascade of 
effects — from shifting weather patterns to dis- 
placed marine species — that the accelerating 
retreat could trigger. 
The US National Snow and Ice Data Center 


(NSIDC) in Boulder, Colorado, announced the 
record decline on 26 August, saying that the 
ice extent had dropped to 4.10 million square 
kilometres (see ‘Going, going..’). The figure is 
70,000 square kilometres less than the previ- 
ous record low, set in 2007, and it came at least 
two weeks before the annual low is typically 
reached. According to the NSIDC, by 9 Sep- 
tember that figure had dropped by another 
14%, to around 3.52 million square kilometres. 

The massive melt has occurred in relatively 
normal weather con- 


ditions, with only one DNATURE.COM 
strong summer storm Formoreonthe 

to hasten the break-up __ receding Arctic 

of the pack ice. Mark _ ice, see: 

Serreze, director of the — go.nature.com/sSw2pr 


NSIDC, says that much of the Arctic pack 
is now thin first-year ice — frozen only last 
winter — which requires much less energy 
to break apart and disperse than multi-year 
ice. “We have entered a new regime, he says. 
“The sea ice is in such poor health in spring 
that large parts of it can’t survive the summer 
melt season, even without boosts from extreme 
weather.’ 

Computer models that simulate how the ice 
will respond to a warming climate project that 
the Arctic will be seasonally ‘ice free’ (defini- 
tions of this vary) some time between 2040 
and the end of the century. But the observed 
downward trend in sea-ice cover suggests that 
summer sea ice could disappear completely 
as early as 2030, something that none of > 
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> the models used for the next report by the 
Intergovernmental Panel on Climate Change 
comes close to forecasting’. 

“There’s a tremendous spread 
between observations and model 
projections,’ says Serreze. “It might 
be that natural variability is larger 
than assumed, or perhaps models 
don't get the change in ice thick- 
ness right.” Uncertainty also 
remains over the strength of 
various natural ‘feedbacks’ For 
example, an exposed ocean is 
darker than an ice-covered sur- 
face and so absorbs more solar 
heat, causing yet more warming 
and melting. 

A lack of fine detail about cir- 
culation patterns in the Arctic 
Ocean could also be throwing off the 
models. For example, a survey carried 
out in 2008 revealed 20 formerly unob- 
served eddies, each some 15 to 20 kilome- 
tres in diameter, in waters north of Canada. 
“Whether these are new features, and what 
role they might play for ocean-mixing pro- 
cesses, we don't know yet,’ says Yves Gratton, 
an oceanographer and Arctic researcher at 
the National Institute of Scientific Research in 
Montreal, Canada. 

Ice loss could also accelerate if the ice pack’s 
underlying waters warm up. Unlike in most 
of the world’s oceans, the coldest water in the 
Arctic, at —1°C to -2°C, is at the surface; below 
a depth of 200-300 metres, saltier and warmer 
water of about 1°C flows in from the Atlantic. 
The cold surface layer — called the halocline 
— isolates the sea ice from the warmer water 
below. 

But the halocline is vulnerable to warming 
from above, says Henning Bauch, a marine 
geologist at the GEOMAR research centre in 
Kiel, Germany. A thinning halocline — some- 
thing that has not yet been observed — would 
not only jeopardize the sea ice but could also 
melt the carbon-rich permafrost beneath shal- 
low coastal waters’, releasing greenhouse gases 
into the atmosphere. 

Meanwhile, Arctic biology is already chang- 
ing, as the retreat and thinning of sea ice allows 
more sunlight to penetrate the upper ocean 
and deprives certain species of habitat, says 
Jorgen Berge, a marine biologist at the Uni- 
versity of Tromso in Norway. For example, the 
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dominant Arctic zooplankton — the copepods 
Calanus hyperboreus and C. glacialis — are 
being replaced by Atlantic C. finmarchicus. 
Meanwhile, Arctic cod (Arctogadus glacialis) is 
increasingly being out-competed by its larger 
Atlantic cousin Gadus morhua’. 

Other species are showing unexpected resil- 
ience. In January, a research cruise into the 
pack ice north of Svalbard, Norway, found evi- 
dence that Apherusa glacialis — amphipods, or 
small crustaceans, that feed on algae growing 
beneath the ice and are the main food source 
for many Arctic sea birds — are less dependent 
on extensive sea ice than previously thought. 
Just as some coastal organisms drift with the 
tide to retain their positions in river estuaries, 
these amphipods can sink to depths at which 
pole-ward ocean currents sweep them farther 
north, where they can find and colonize sur- 
viving sea ice’, 

“This looks like an adaptive trait that allows 
key Arctic-ice fauna to persist in changing 
sea-ice environments,” says Berge, who led 
the study. He points out that the Arctic Ocean 
has been ice-free during the summer several 
times during the past 2.5 million years, and 
that as recently as 8,000-6,500 years ago, dur- 
ing a period known as the Holocene Thermal 


Norwegian 
Sea 


Maximum, much more of the Arctic was ice- 
free than it is today* — with no mass-extinc- 
tion events known to have occurred. 
Farther afield, this year’s record sea- 
ice melt might foreshadow a harsh 
winter in parts of Europe and 
North America. Recent research, 
although preliminary, suggests a 
connection between late-sum- 
mer Arctic sea-ice extent and 
the location of areas of high 
and low atmospheric pressure 
over the northern Atlantic. 
The highs and lows can remain 
relatively fixed for weeks, 
shaping storm tracks and sea- 
sonal weather patterns such as 
extended cold surges®. 
Ralf Jaiser, a climate scientist at 
the Alfred Wegener Institute for Polar 
and Marine Research in Potsdam, Ger- 
many, found a significant correlation in 

1989-2011 meteorological data between 
late-summer Arctic sea-ice extent and atmos- 
pheric-pressure anomalies that favour extreme 
weather such as prolonged cold snaps in win- 
ter’. He reasons that in autumn, the open 
Arctic Ocean sheds heat to the high-latitude 
atmosphere. The warming tends to reduce the 
large-scale atmospheric-pressure gradient and 
weakens the dominant westerly winds in the 
Northern Hemisphere. Those winds normally 
sweep warm, moist Atlantic air to western 
Europe; their weakening leaves the region 
more prone to persistent cold. 

“The impacts will become more apparent 
in autumn, once the freeze-up is under way 
and we see how circulation patterns have influ- 
enced the geographical distribution of sea ice,” 
says Judith Curry, a climate researcher at the 
Georgia Institute of Technology in Atlanta. 
But, she adds, “We can probably expect some- 
where in the mid/high latitudes of the North- 
ern Hemisphere to have a snowy and cold 
winter.’= 
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The Asian tiger mosquito (Aedes albopictus) can transmit at least 22 different viruses. 


DISEASE CONTROL 


Furope on alert 
for flying invaders 


Spread of disease-carrying mosquitoes prompts 
guidelines for boosting surveillance. 


BY DECLAN BUTLER 


he buzz of tiny wings has been growing 
‘Tee louder in Europe over the past 
decade, as invasive mosquitoes carry- 
ing exotic diseases such as chikungunya and 
dengue fever have taken up residence around 
the continent. Now Europe's entomologists 
and public-health experts are joining forces to 
defend the region against the growing threat. 
On 28 August, the European Centre for 
Disease Prevention and Control (ECDC) in 
Stockholm released guidelines that should 
help European countries to develop mos- 
quito surveillance, which had become mostly 
unnecessary after malaria was largely eradi- 
cated from the continent in the mid-twentieth 
century. But the need for better vigilance has 
been growing again since the late 1990s, as 
the increased globalization of trade gives spe- 
cies such as the Asian tiger mosquito (Aedes 
albopictus) more opportunities to penetrate 
into Europe. 
The surveillance data will feed into the 
three-year-old VBORNET consortium, a 


Europe-wide network of some 400 medical 
entomologists, microbiologists and public- 
health experts that is funded by the ECDC. The 
wealth of data they are assembling should help 
researchers to better understand how invasive 
mosquitoes are spreading and the risks they 

pose, and so to develop control strategies. 
The latest guidelines provide an A-Z of 
mosquito surveillance, covering topics rang- 
ing from what sort of mosquito traps should 
be used in particular circumstances to resource 
management and policy-making advice. 
Francis Schaffner, head of integrated pest 
management at the consultancy Avia-GIS in 
Zoersel, Belgium, and one of the lead authors 
of the guidelines, hopes that they will encour- 
age scientists, local authorities and govern- 
ments to carry out more comprehensive 
surveillance. By recommending standardized 
surveillance reporting, 
the guidelines should 

Read more on 


also allow data generated 
vectorcontrolinthis by different researchers 
Outlook: in different countries to 


be seamlessly combined 
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into pan-European databases, he says. 

A key target of surveillance is the Asian 
tiger mosquito, which can transmit at least 
22 different viruses, including chikungunya 
and dengue. After it was reported in Albania 
in 1979, this native of southeast Asia spread to 
Italy in 1990 and has now expanded along the 
Mediterranean coast to nations such as France 
and Spain, as well as around the Black Sea (see 
“Multiplying mosquitoes’) (J. M. Medlock et al. 
Vector-Borne Zoonotic Dis. 12, 435-447; 2012). 
At least five other invasive mosquitoes capable 
of spreading diseases are also gaining ground 
in Europe, including Aedes aegypti, which can 
transmit dengue and other diseases. 

The international tyre business brings 
most of the unwanted arrivals. Mosquitoes 
can breed in small volumes of water, such as 
rainwater in tyres, and invasive forms have 
drought-resistant eggs that enable stowaways 
to survive long-distance transport before 
being revived by rain once they reach their 
destinations. The invaders can also hitch a 
ride on imported plants, such as lucky bamboo 
(Dracaena species). In the future, climate 
change could accelerate the spread of these sub- 
tropical and tropical species: predictive models 
suggest that invasive mosquitoes could find 
suitable habitats in northern Europe by 2030. 

In 2007, Italy saw an outbreak of several 
hundred cases of chikungunya, which causes 
nausea, vomiting and debilitating joint pain. 
Dengue fever, which causes similar symp- 
toms and can be fatal, hit a handful of people 
in France and Croatia in 2010. And there has 
been a steady rise in cases of West Nile virus, 
the fever that is currently notching up a record 
year for infections in the United States. 

Surveillance remains patchy or non-existent 
in many of the most affected countries around 
Europe, including Portugal, Greece, Turkey 
and those bordering the Black Sea. Monitor- 
ing insects is expensive and time-consuming, 
and governments that are preoccupied with 
existing health challenges may be reluctant to 
put funds towards tracking potential future 
threats. But one underlying message of the 
report is that early surveillance is ultimately 
cheaper than dealing with an established pest, 
which can require widespread spraying with 

insecticides. 
In the Nether- 
lands, for example, 
Asian tiger mosqui- 
toes are introduced 
every year, but sur- 
veillance and control 
measures have so far prevented them from 
becoming established, says Marieta Braks, 
an entomologist at the National Institute 
for Public Health and the Environment in 
Bilthoven, and leader of VBORNET’s strate- 
gic consultation group. The Dutch authorities 
pay close attention to companies that import 
tyres from countries harbouring dangerous 
mosquitoes, and that stock tyres outdoors, 
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> where rain could allow mosquito eggs 
to hatch. In countries such as Italy, where 
the tiger mosquito is already widespread, 
surveillance is instead geared more towards 
keeping numbers under control. 

VBORNET’s geographical information sys- 
tem allows scientists to combine surveillance 
data with environmental factors, including 
temperature and humidity, to better track the 
main invasive mosquito species. Models based 
on these combined data can reveal, for exam- 
ple, how environmental conditions can affect 
mosquitoes’ ability to transmit pathogens, and 
so provide information on the likelihood of 
outbreaks in various regions. 

The maps may also perform a useful peer- 
pressure function, says Braks, by highlighting 
where researchers and authorities have failed 
to supply and share data. “What we hope is that 
no one will want to be a blank on the map: = 


MULTIPLYING 
MOSQUITOES 


Data from June 2012 show 

that the Asian tiger mosquito — 

(Aedes albopictus) has spread 

rapidly across southern Europe T 
over the past decade. 
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EU battles over research billions 


Politicians will spend the next few months negotiating cuts to Horizon 2020’s budget. 


BY NATASHA GILBERT 


r | 1ens of billions of euros are at stake as 
negotiations ramp up to shape Europe's 
next seven-year research programme. 

The discussions will cover familiar divisions 

over applied versus basic research and con- 

flicting national agendas, but the continent's 
ongoing financial problems will add an extra 
measure of anxiety. 

The European Commission proposed 
the general size and shape of the Horizon 
2020 research and innovation programme in 
November 2011 (see Nature http://doi.org/ 
dtvtbx; 2011). European Union (EU) member 
states and members of the European Parliament 
will negotiate and agree on the budget and final 


BUDGET BREAKDOWN 


details of the programme, which will run from 
2014 to 2020, over the next 15 months. 

Last year, the commission requested 
€77.6 billion (US$99 billion) for Horizon 2020, 
out of a total European budget for 2014-20 of 
about €1 trillion (at constant 2011 prices). But 
many members of the European Parliament 
(MEPs) want research to get €100 billion, 
twice as much as was allocated to Horizon 
2020’s predecessor, the Seventh Framework 
Programme (FP7), which runs from 2007 until 
2013 (see ‘Budget breakdown’). Most scien- 
tists and EU science officials say that such a 
large jump is unrealistic, and there are grow- 
ing fears that even the lower budget proposed 
by the commission is unlikely, given Europe's 
shaky economy. 


The proposed budget for Europe’s Horizon 2020 research-funding programme, the successor to its 
Seventh Framework Programme, would boost the European Research Council (ERC), the Marie Curie 
Actions career-development programme and the European Institute of Innovation and Technology (EIT). 


Excellent ee el eto AW 
science leadership allenges 
Ecotec 2a Bn €177bn €30.7bn €2.8bn 
€77.6 bn* > 
Other 
€13.3 bn €5.8 bn €2 bn 
SEVENTH ERC Marie Curie 
FRAMEWORK €7.5 bn €4.8 bn 
PROGRAMME 
(2007-13) 


€50.5 bn'> [N/a 


* In constant 2011 prices 
T In constant 2006 prices 
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At a meeting on 30 August, high-level 
ministers said that cuts to the proposed EU 
budget are “inevitable’, implying that spend- 
ing on Horizon 2020 will be eroded. Ministers 
are aiming to reach a consensus on the size 
and shape of the EU budget by the end of the 
year, but in the meantime, MEPs, scientists and 
member-state officials will fight over which 
of Horizon 2020's ambitions should bear the 
brunt of any budget reductions. The first sortie 
will come on 17-18 September, when MEPs on 
the European Parliament'’s Industry, Research 
and Energy (ITRE) committee will begin dis- 
cussing around 3,000 suggested amendments 
to the Horizon 2020 budget. 

Most likely to feel the pinch is the European 
Institute of Innovation and Technology (EIT), 
a virtual institute headquartered in Budapest 
and made up of scientific ‘communities that 
conduct research on themes such as climate 
change and sustainable energy. The institute 
also runs postgraduate education programmes 
and knowledge-transfer activities to dissemi- 
nate the fruits of research to businesses and 
policy-makers (see Nature 440, 8-9; 2006). 
The EIT concept was developed and cham- 
pioned by the European Commission, which 
now suggests that the institute's budget should 
win €2.8 billion from Horizon 2020, 3.6% of 
the total research budget and vastly more than 
the €309 million in European funds allocated 
to it between 2008 and 2013. 

However, few outside the commission want 
the EIT to claim so much; some argue that the 
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institute has yet to prove its scientific worth. 
Teresa Madurell, a Spanish social-democrat 
MEP on the ITRE committee, suggests that 
the EIT should get no more than 3.1% of 
Horizon 2020's funds. Paul Riibig, an Aus- 
trian Christian Democrat MEP who also 
sits on the committee, wants to trim that 
even further, suggesting that the EIT’s remit 
be reduced to knowledge-transfer activities 
only. 

The EIT’s critics say that the money 
would be better spent on initiatives with 
a proven track record. The most popu- 
lar programmes include the European 
Research Council (ERC), which dispenses 
excellence-based grants for frontier science, 
and the Marie Curie Actions programme, 
which provides career-development grants 
to young researchers. The commission pro- 
posed that the ERC’s budget should rise by 
77% to €13.3 billion, but Luke Georghiou, 
who studies European research policy at 
the University of Manchester, UK, expects 
opposition from member states in eastern 
Europe, which could hold up the budget 
negotiations. Scientists in those coun- 
tries often lose out to those from research 
powerhouses such as the United Kingdom 
and Germany when ERC grants are allo- 
cated. Some say that the ERC should also 
fund those scientists with the ‘potential 
for excellence. Vicky Ford, a British Con- 
servative member of the ITRE committee, 
counters that ERC funding should go only 
to excellent research. 

The ERC tends to fund individual 
researchers, so during FP7 larger collabo- 
rations have relied on separate funding 
streams. But some scientists are concerned 
that funding for applied research has 
squeezed out support for basic science 
in these programmes. Over the past four 
years, calls for collaborative proposals in 
areas of basic science such as epigenetics 
and protein regulation have become less 
frequent, says Karin Metzlaff, executive 
director of the European Plant Science 
Organisation in Brussels. She hopes that 
the ITRE committee meeting can help 
to make basic-science projects a bigger 
priority in Horizon 2020’s collaborative- 
research programme, dubbed Societal 
Challenges. 

Overall, research is likely to be spared the 
EU budget’s most severe cuts, because poli- 
ticians recognize that investing in science 
is central to boosting economic growth. 
Indeed, countries such as Spain, which 
has slashed domestic science spending 
owing to the financial crisis, will become 
more reliant on Horizon 2020 funds. That 
may even prompt them to give ground in 
negotiations on contentious issues such as 
human embryonic stem-cell research, says 
Georghiou. “They need to replace their 
national funding shortage,’ he says. m 


IN FOCUS | NEWS 


Marc Hauser left Harvard University in 2011 after the institution found him guilty of misconduct. 


RESEARCH ETHICS 


Misconduct ruling 
is silent on intent 


Psychologist Marc Hauser admits errors but not fraud. 


BY EUGENIE SAMUEL REICH 


avens of research ethics often 
insist that there is a clear differ- 
ence between sloppy science and 


scientific fraud. But if ever there was a case 
that blurs that line, it is that of Marc Hauser, 
a high-flying evolutionary psychologist who 
resigned from Harvard University in Cam- 
bridge, Massachusetts, in 2011, after the uni- 
versity found him guilty of misconduct. 

In a finding published on 6 September in 
the Federal Register, the US Office of Research 
Integrity (ORI) in Rockville, Maryland, which 
oversees science funded by the National Insti- 
tutes of Health (NIH), has also concluded that 
Hauser “engaged in research misconduct”. The 
office found that he: “published fabricated data” 
in the journal Cognition in 2002’; “falsified” 
results in an unpublished study; and made false 
statements in four other instances. But the ORI 
did not state whether Hauser acted intention- 
ally — its definition of misconduct also covers 
cases in which misrepresentations are “reckless”. 

That has provided enough wiggle room for 
both supporters and critics to claim the finding 
as a vindication. “It isn’t a strong case for inten- 
tional misconduct. It seems it’s peccadilloes,” 
says Pierre Pica, a linguist at the French National 


Center for Scientific Research in Paris, who has 
criticized the investigation. But Gerry Altmann, 
a psychologist at the University of York, UK, and 
editor-in-chief of Cognition, disagrees. “The 
number of instances [in the investigated body 
of work] suggests it is extremely unlikely it was 
not intentional,” he says. 

Harvard had found Hauser guilty of eight 
counts of misconduct in 2010, after a three- 
year internal investigation of some of his stud- 
ies of cognition in non-human primates. To the 
annoyance of many scientists, the university did 
not release its findings, so exactly what Hauser 
had done remained unclear. The ORI provides 
details on six counts of misconduct, three of 
which were published’. 

In a statement sent to Nature, Hauser 
acknowledges mistakes and errors, but not 
scientific fraud or research misconduct, which 
the ORI says he neither admits nor denies. 
“Although I have fundamental differences with 
some of the findings in the ORI report; Hauser 
says, “I acknowledge that I made mistakes. I let 
important details get away from my control, and 
as head of the lab, I take responsibility for all 
errors made within the lab, whether or not I was 
directly involved.” No one else who worked in 
Hauser’s lab has been charged with misconduct. 

The ORI often resolves allegations > 
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against scientists through a settlement 
agreement, which helps the reputedly over- 
stretched office to save on legal resources. 
Hauser’s settlement means that he escapes 
the harshest possible punishment — a ban 
on future NIH funding — and must instead 
submit a supervisory plan that will ensure 
integrity if he applies to the agency for any 
future research grants. (The agency spent 
at least US$790,000 on grants that funded 
work affected by the misconduct.) 

Bennett Galef, an expert in animal 
behaviour at McMaster University in 
Hamilton, Canada, reviewed evidence 
used in the Harvard investigation at the 
request of Hauser’s lawyer, and questions 
the ORI’s use of the word “fabricated” to 
describe the errors in the Cognition paper'. 
“That is a conceivable read of what hap- 
pened, butit’s slanted,” he says. According 
to Galef, lab records show that in the study, 
which examined habituation to sound 
patterns, two groups of tamarin monkeys 
were played the same stimuli instead of 
different ones, as the paper reported. Galef 
says that this was because of an error in the 
computer program that played the stimuli, 
which the authors plausibly didn’t know 
about because the protocol called for them 
to be blinded to the stimuli at the time they 
coded the monkeys’ responses. “It was a 
big mistake. It was definitely a disaster, 
but whether there was any fabrication, you 
don't know,” says Galef. 

ORI officials familiar with Hauser’s case 
could not comment because of govern- 
ment privacy rules. The inspector-gen- 
eral of the National Science Foundation, 
which also funded Hauser, is still looking 
into one of the misconduct counts found 
by Harvard; the remaining instance was 
referred to the ORI but has been deemed 
outside its statute of limitations. Harvard 
says that the graduate students and post- 
docs who were working in Hauser’s lab 
have been relocated to other labs within 
the university. 

Hauser has now begun a different 
chapter in his life, working with at-risk 
youth. His website states that he has co- 
founded a company called Gamience, 
which develops computer games to teach 
self-control and other cognitive skills, and 
is working with a non-profit company that 
serves schools on Cape Cod, called the 
Cape Cod Collaborative, in Bourne, Mas- 
sachusetts. “This work is deeply satisfying 
and I look forward to making new contri- 
butions to human welfare, education and 
the role of scientific knowledge in under- 
standing human nature,” Hauser says. m 
1. Hauser, M. D., Weiss, D. & Marcus, G. Cognition 
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2. Hauser, M. D., Glynn, D. & Wood, J. Proc. R. 
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The Dark Energy Survey camera will investigate millions of galaxies for the subtle effects of weak lensing. 


COSMOLOGY 


Cameras to focus 
on dark energy 


A pair of detectors that measure minute distortions in images 
of distant galaxies will probe the riddle of cosmic acceleration. 


BY ERIC HAND 


a bit lopsided. But this is an attribute, not 

a bug. Because mass distorts space-time, 
light coming from distant galaxies is bent as it 
passes through intervening shoals of invisible 
matter, leaving the images of these distant 
objects minutely sheared and stretched. 

Two astronomical surveys now scheduled 
to come online seek to take advantage of this 
effect, which is known as weak gravitational 
lensing. The surveys aim to use the technique 
to get a firmer handle on dark energy, the mys- 
terious force that is apparently speeding up the 


B= the best pictures ofa distant galaxy are 


ER 2012 


© 2012 Macmillan Publishers Limited. All rights reserved 


expansion of the Universe. By observing the 
patterns of distortions across large swathes of 
sky (see ‘Falling into line’), astronomers hope to 
map the density and distribution of dark mat- 
ter, the web-like invisible scaffolding around 
which visible matter is thought to have first 
coalesced. Then, by looking at changes in this 
hidden web across cosmic time, they hope to 
discern the imprint of dark energy. 
Observers already 
study the effects of dark 


Read more about energy by tracking cos- 
the search for mic landmarks: the 
dark energy at: standard candles of dis- 


tant supernovae and the 
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against scientists through a settlement 
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were played the same stimuli instead of 
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about because the protocol called for them 
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referred to the ORI but has been deemed 
outside its statute of limitations. Harvard 
says that the graduate students and post- 
docs who were working in Hauser’s lab 
have been relocated to other labs within 
the university. 

Hauser has now begun a different 
chapter in his life, working with at-risk 
youth. His website states that he has co- 
founded a company called Gamience, 
which develops computer games to teach 
self-control and other cognitive skills, and 
is working with a non-profit company that 
serves schools on Cape Cod, called the 
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butions to human welfare, education and 
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expansion of the Universe. By observing the 
patterns of distortions across large swathes of 
sky (see ‘Falling into line’), astronomers hope to 
map the density and distribution of dark mat- 
ter, the web-like invisible scaffolding around 
which visible matter is thought to have first 
coalesced. Then, by looking at changes in this 
hidden web across cosmic time, they hope to 
discern the imprint of dark energy. 
Observers already 
study the effects of dark 
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standard rulers of ripples in the distribution 
of galaxies. But those techniques reveal only 
how the Universe's expansion rate has changed 
because of dark energy. Weak lensing offers an 
extra prize: the distortions hold clues to the 
action of gravity at far-off locales. The tech- 
nique can thus address an increasingly press- 
ing question for theorists: what if the Universe's 
accelerated expansion is not the result of dark 
energy’s mysterious anti-gravitational force, but 
is instead a sign that gravity behaves differently 
in different parts of the Universe? 

“It provides unfettered insight into the prop- 
erties of gravity that we can't get by other tech- 
niques,’ says Rachel Bean, an astrophysicist at 
Cornell University in Ithaca, New York. “Its 
potential is massive, but so are the challenges.” 

Unlike strong gravitational lensing, in which 
a major concentration of mass markedly distorts 
the image ofa single galaxy, the subtle effects of 
weak lensing can be detected only in detailed 
surveys of millions of galaxies. One of the new 
efforts will use the Hyper Suprime-Cam (HSC), 
which achieved first light on 28 August on the 
8.2-metre Subaru telescope in Hawaii. By 2018, 
it aims to have imaged 10 million galaxies over 
a 1,500-square-degree-swathe of the sky, says 
Satoshi Miyazaki of the National Astronomi- 
cal Observatory of Japan in Mitaka, who is the 
principal investigator of the HSC survey. 

A rival project, the Dark Energy Survey 
(DES), is set to start operating later this month 
on the 4-metre Blanco telescope in Chile, says 
its principal investigator, Josh Frieman of the 
Fermi National Accelerator Laboratory in 
Batavia, Illinois. Being the main project for the 
Blanco telescope, the DES will get more observ- 
ing nights than the HSC, allowing it to image 
300 million galaxies over 5,000 square degrees 
by 2018. But its smaller telescope means that it 
will not be able to peer as deeply into the distant 
Universe as the HSC survey. 

Even with these massive digital cameras — 
870 million pixels for the HSC and 570 mil- 
lion pixels for the DES — weak lensing “is a 
very, very difficult measurement to make’, says 
Bean. To detect the subtle distortions caused 
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by lensing, for example, astronomers must 
subtract aberrations caused by the optics of the 
telescope and by Earth’s atmosphere. 

Doing the measurements from space could 
help. The European Space Agency is planning 
to launch a dark-energy probe called Euclid 
in 2019. And in the United States, an as-yet- 
unfunded dark-energy mission called the 
Wide-Field Infrared Survey Telescope, which 
would dedicate much of its time to weak- 
lensing measurements, was ranked as the top 
priority for space-based astronomy in a 2010 
decadal survey. 

Some astronomers doubt that the technique 
can outperform supernova surveys and meas- 
urements of baryonic acoustic oscillations — 
ripples in cosmic structure dating from several 
hundred thousand years after the Big Bang. “I 
think the jury’s still out,” says David Schlegel, 
principal investiga- 
tor of the Baryon 
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National Laboratory 
in Berkeley, Califor- 
nia. He points out, for 
instance, that a key assumption in weak lens- 
ing — that galaxies are oriented randomly — is 
turning out not to be true, forcing astronomers 
to correct for a systematic effect. 

But others say that it is important to take 
diverse approaches to the dark-energy mystery. 
Chris Stubbs, an astrophysicist at Harvard Uni- 
versity in Cambridge, Massachusetts, says that, 
a decade ago, astronomers were intent on simply 
proving that the effect was real. Now there is 
little doubt, and they need techniques such as 
weak lensing to explore more-detailed ques- 
tions — for instance, whether the effect varies 
with distance, direction or the density of matter, 
and whether the effects of dark energy can be 
distinguished from those of variable gravity. 
“We are only now entering an era where we're 
attacking the dark-energy problem with instru- 
ments and experiments that were explicitly 
optimized for that purpose,’ says Stubbs. m 


challenges.” 


When light from a randomly oriented group of galaxies passes through a region influenced by the gravity 
ofa large concentration of dark matter, the galaxy images show a slight alignment along a particular 
direction. Such distortions can reveal the changing concentrations of matter at different cosmic times. 
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Soaring sales of antibiotics at Indian pharmacies are compounding drug-resistance problems. 


PUBLIC HEALTH 


India moves to tackle 
antibiotic resistance 


Drug regulator aims to restrict over-the-counter sales. 


BY ERICA WESTLY IN CHENNAI, INDIA 


umar Prabhash’s voice is sombre as he 
k describes how he is losing a key weapon 
in the fight against hospital infections. 
At the Tata Memorial Centre in Mumbai, 
where the oncologist works, at least half of 
bacterial samples from patients with infec- 
tions are resistant to carbapenems — a class 
of ‘second-line antibiotics used to treat infec- 
tions that are already resistant to other drugs. 
Just a few years ago, the resistance rate in such 
samples was only 30%. “That is really scary for 
us,” says Prabhash. 

Antibiotic resistance is a problem world- 
wide, but is particularly worrying in India, 
where hospital standards are inconsistent 
and antibiotics are readily available over the 
counter at pharmacies. Prabhash’s story was 
all too familiar to the physicians who heard it 
at India’s first joint medical-society meeting 
on antibiotic resistance, held on 24 August in 
Chennai. The symposium is part of an ongoing 
campaign by Indian clinicians for a national 
policy to curb misuse of these vital drugs. 

“As physicians, we have a responsibility for 
the future of India,’ said J. A. Jayalal, who rep- 
resented the Indian Medical Association at the 


meeting. “If we're not raising our voices about 
this, future generations will not forgive us.” 
Last week, physicians moved a step closer 
to their goal when India’s drug regulators 
announced a plan that would put tight restric- 
tions on the sale of antibiotics. Carbapenems 
and many other antibiotics are already on a 
list of 536 drugs in India that require a pre- 
scription. But studies have shown that such 
drugs are easy to purchase at retail pharmacies 


ANTIBIOTICS FOR ALL 


Carbapenem antibiotics are increasingly available 
without prescription in India, compared with 
Western nations. This is driving the emergence of 
microbial resistance to the drugs in the country. 
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without a physician’s signature (U. P. Rathnaker 
etal. J. Clin. Diagn. Res. 6, 951-954 (2012) and 
A. Kotwaniet al. J. Clin. Pharm. Ther. 37, 308- 
312; 2012). Under the new regulations, car- 
bapenems and other antibiotics with declining 
efficacy would be treated as a special category 
of prescription drugs. They would carry prom- 
inent red labels, and surprise inspections by 
the regulator would check whether pharmacies 
were selling them over the counter. 

Drug misuse is a key factor that leads to 
antibiotic resistance, says Arjun Srinivasan, a 
medical epidemiologist at the US Centers for 
Disease Control and Prevention in Atlanta, 
Georgia. A patient's risk of developing a life- 
threatening infection that is resistant to car- 
bapenems is 15 times greater if he or she has 
been exposed to the drug class before, he notes. 
“Improved antibiotic use will be essential to 
combat drug resistance,” he adds. 

There is ample evidence of excessive 
carbapenem use in India. A survey (go.nature. 
com/rpsg2r) published earlier this year by the 
Center for Disease Dynamics, Economics and 
Policy, a non-profit health-research organiza- 
tion headquartered in Washington DC, found 
that Indian pharmacy sales of carbapenems 
increased by nearly six times between 2005 
and 2010 (see ‘Antibiotics for all’). 

Limited access to basic medical care is one 
reason for the overuse of carbapenems and 
other second-line antibiotics. About 70% of 
India’s 1.2 billion citizens live in rural areas, 
where, despite government efforts, hospitals 
are often understaffed and lacking basic equip- 
ment and medication. Rather than relying on 
physicians, many rural patients turn to local 
pharmacists and whatever drugs they have in 
stock. 

In 2010, India’s government created a task 
force to develop a national antibiotic policy. 
But last year, Indian health minister Ghulam 
Nabi Azad rejected the task force’s recom- 
mendations to ban over-the-counter sales of 
antibiotics and prohibit the use of carbapen- 
ems and other second-line antibiotics except 
in major hospitals. Azad argued that the bans 
could have a devastating effect on health in 
rural areas. 

The health ministry did not respond to 
Nature’s questions about the latest recommen- 
dation from India’s drug regulator, but a health- 
ministry representative told the Times of India 
that the office did intend to approve the anti- 
biotic restrictions, possibly as early as this week. 

Infectious-disease specialist Abdul Ghafur, 
who organized the joint society meeting in 
Chennai, hopes to use momentum from the 
latest restrictions to push for a national anti- 
biotic policy to regulate India’s 24,000 hospi- 
tals and hundreds of thousands of physicians, 
as well as its pharmacies. “I am now trying to 
follow up with the health minister’s office to 
arrange a meeting with the various medical 
societies,’ he says. “There are still many more 
steps ahead” m= 
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DIVE MASTER 


The US flagship submersible Alvin is getting 
a partial upgrade. But deep-sea exploration 
faces some rough water. 


BY RICHARD MONASTERSKY 


t an industrial hangar in Woods Hole, Massachusetts, 

a plush toy dog stands guard next to a 5-tonne titanium 

sphere. Polished to a high shine, the 2.2-metre orb reflects 

the lights overhead and the faces of passing scientists, who 
come to ogle the US$11-million engineering marvel — a container 
strong enough to ferry researchers 6,500 metres down to the bottom 
of the ocean. 

The sphere will be the new heart of the Alvin submersible, the United 
States’ workhorse in the deep sea. Over its five-decade career, Alvin has 
dived to the sea floor more than any other research submersible and has 
made some of the most important oceanographic discoveries of the past 
few generations. But in the past few years, it has been showing its age, 
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and other countries have surged past with vehicles that can venture far 
deeper than Alvin’s current limit of 4,500 metres. In 2010, Alvin’s care- 
takers at the Woods Hole Oceanographic Institution (WHOD) took the 
venerable submersible out of commission for a $40-million redesign. 

The massive upgrade was a long time coming. “I started here in 1996 
and there was talk about building a new sub then,’ says Bruce Strickrott, 
an Alvin pilot and expedition leader at WHOI, who is managing the 
sub’s reconstruction. 

This month, Strickrott and his crew will start to rebuild the sub 
around the passenger sphere, readying it for a series of final tests off the 
coast of Bermuda by March. Assuming it passes, Alvin will resume a full 
schedule of scientific dives, joining a suite of tools that are giving US 
researchers unprecedented ways to study the deep sea — from manned 
subs like Alvin to remotely controlled robots and autonomous mappers. 

But after celebrating the revamped sub’s maiden voyage, researchers 
will wake up to some sobering realities. Budget problems have con- 
strained the current Alvin upgrade, so researchers must wait for the 
completion of a second construction phase, in 3-5 years, before they 
can take advantage of the greater depth limit allowed by the new sphere. 
That leaves the United States well behind France, Russia, Japan and now 
China, which sent its research submersible below 7,000 metres in June. 
And as wealthy entrepreneurs such as film director James Cameron 
capture public attention with dives in privately built subs, the United 
States is making painful cuts to many of its deep-sea assets. The admin- 
istration of President Barack Obama is propos- 
ing to cut off funding next year for a pair of 
manned submersibles in Hawaii that can dive 
to 2,000 metres. 

“America has one submersible capa- 
ble of reaching Titanic-ish depths — in 


The Alvin submersible 
was taken out of 
service for upgrades 
in 2010 and will dive 
again next year. 
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the couple-of-miles range — which these days isn’t that deep,” says 
Cameron. “What is the future of deep submergence research?” 

That future once looked boundless. On 17 February 1977, Alvin 
skimmed along the ocean bottom about 400 kilometres northeast of 
the Galapagos Islands. Days before, sensors towed by a ship had identi- 
fied a spike in water temperatures near the sea floor. Alvin explored 
the region and brought researchers face to face, for the first time, with 
hydrothermal vents, fissures in the sea floor that spew streams of warm, 
shimmering brine into the near-freezing water. In what was supposed 
to be a barren desert at the ocean bottom — far from the reaches of life- 
powering sunlight — Alvin revealed oases of clams, crabs, shrimp and 
alien-like white tube worms crowned with crimson tops. 

Two years after the Galapagos dives, an Alvin expedition witnessed 
amore violent scene: jets of black fluids, superheated to 400°C, shoot- 
ing out of gigantic chimneys of rock, which were dubbed black smok- 
ers. Later, in the geologically quieter Gulf of Mexico, Alvin found vents 
that were releasing cooler methane-rich fluids. These ‘cold seeps’ also 
support communities of animals, including slow-growing tube worms 
thought to be hundreds of years old — some of the most ancient crea- 
tures alive on the planet. The discovery of such ecosystems, powered by 
microbes that harness chemical energy from vent waters, raised ques- 
tions about where life originated on Earth and where it may reside on 
other worlds. 

Alvin's string of accomplishments silenced early critics — some of 
the top geoscientists of the 1960s and 1970s — who initially regarded 
the submersible as little more than a toy. And ever since, the vehicle has 
been in high demand, trekking to the sea floor 100-150 times a year. 
Although some countries can dive more deeply, Alvin “is the most reli- 
able submersible there is’, says Susan Humphris, a geochemist at WHOI 
and principal investigator of the Alvin overhaul. “We need a workhorse 
at the bottom of the ocean. That's what Alvin has been and that’s what 
Alvin will continue to be.” 


AT THE HELM 

Cindy Van Dover knows the submersible better than any other scientist. 
Now the director of the Duke University Marine Laboratory in Beau- 
fort, North Carolina, Van Dover spent nearly three years training and 
working as an Alvin pilot after getting her doctorate at Woods Hole. She 
is the only researcher, and the only woman, to have piloted the craft, 
choosing to detour from a conventional research career because the 
job offered opportunities that few researchers can get. “I wanted to do 
alot of dives,’ she says. “It was about wanting to see the environment 
that I was studying” 

In 1991, when Van Dover gave up her pilot’s seat and went back to 
being an occasional passenger in Alvin, one of the things she missed 
most was the view. The old sphere, built in 1973, had three 13-centi- 
metre-wide viewports, one on each side for passengers and one at the 
front for the pilot. Through those saucer-sized side ports, Van Dover 
says, “sometimes it was a great view but often it wasn't, so youd end up 
watching a video of what the pilot was doing”. The new sphere retains 
the side windows but has three 18-centimetre-wide ones at the front, 
giving researchers a field of view that overlaps with the pilot’s (see “Voy- 
ager to the bottom of the sea’). 

The space inside the sphere has also grown, which should make for 
less-painful rides. With an internal diameter of less than 2 metres, the 
old sphere had no room for seats; scientists had to huddle on the floor, 
and lean awkwardly against the sloped walls. After a 2-hour descent to 
the bottom, limbs would stiffen up. “You switch your position because 
your legs are so uncomfortable and you rock the whole submersible,” 
says Chris German, a marine geochemist at WHOI and chief scien- 
tist for the US National Deep Submergence Facility in Woods Hole, 
which coordinates the use of Alvin. “That can compromise taking 
the samples.” 

The new sphere has 18% more interior space and will have ergonomi- 
cally designed seats. And while sampling or exploring, researchers will be 
able to take advantage of brighter lights and improved camera systems. 
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Scientists are keen to use the tools to explore the geology and biology 
of the mid-ocean ridges, the 60,000-kilometre-long chain of volcanic 
fissures that forge new ocean crust. So far, oceanographers have sur- 
veyed only a tiny fraction of the ridge system. The ridges and their flanks 
hide a subsurface plumbing network of fissures and pores so extensive 
that it circulates a volume of water equal to the entire ocean about every 
million years or so, altering the chemistry of the seas. Future dives will 
also target the communities of microbes living beneath the sea floor, 
which has been estimated to constitute as much as one-third of the 
planet’s biomass. And marine scientists want to study the vast abyssal 


plains, which cover more than half of 

rT} Earth’s surface and are poorly explored; 

WH | IS Td E most of the species that researchers pull 

up in samples from abyssal sites are new 

to science. 

FUT AE (lk l EE [ But some places that scientists want 

to visit lie deeper than 4,500 metres 

and so will remain off-limits to Alvin 

\ | B ) ERG - i ( E for now. The National Science Founda- 

tion, which is funding most of the Alvin 

x redesign, could not afford to build a 

RESEARCH? 6,500-metre-class sub all at once. In 

1 particular, such deep dives require more 

energy and it would be prohibitively 

expensive to build a submersible that uses current battery technology 
to dive frequently and to greater depths. 

Rich Lutz, an oceanographer at Rutgers University in New 
Brunswick, New Jersey, who has racked up more Alvin dives than per- 
haps any scientist, was surprised to learn that the current overhaul 
wont add to the sub’s depth range. “I certainly would have assumed 
that one of those improvements would be the capability of going down 
to 6,500 metres,” he says. 

BOTTOM DWELLER 

Yet depth isn’t everything. Despite the excitement over record-breaking 
dives by China’s Jiaolong (7,020 metres) and Cameron’s Deepsea 
Challenger (11,000 metres), researchers don't go deep very often. 
Fewer than 10% of the dives by the French and Japanese submersibles 
approach their limits. Vincent Rigaud, manager of the underwater sys- 
tems department at the French Research Institute for Exploration of the 
Sea in La Seyne-sur-Mer, says that research is increasingly focusing on 
shallower depths. “The science today is linked to [minerals] exploita- 
tion or areas where there is an impact from man’s activities or man’s 
interests,” he says. 

Some US researchers are more upset about other limitations of the 
new Alvin. “It's a superb tool for what it does but it could have been a hell 
of a lot better; says Bruce Robison, a marine ecologist at the Monterey 
Bay Aquarium Research Institute in Moss Landing, California. In 2004, 
Robison was a member ofa committee of the National Research Council 
that assessed the future needs of deep submergence science. One of his 
biggest wishes was for a human-occupied vehicle (HOV) that could 
adjust its buoyancy to stop at any depth, allowing researchers to work 
well above the ocean floor. Most of the biological and chemical pro- 
cess in the ocean take place in the water column, not on the sea floor, 
Robison says. Yet neither the old Alvin nor the rebuilt sub can adjust 
its buoyancy enough to linger above the sea floor without draining its 
batteries. So nearly all trips go straight to the bottom. 

Along with concerns over Alvin’s limitations, US researchers also 
lament the loss of several scientific submersibles. Funding woes forced 
Florida International University’s Harbor Branch Oceanographic Insti- 
tute in Fort Pierce to retire its two submersibles, the Johnson Sea Links, 
last year. And the National Oceanic and Atmospheric Administration 
plans to stop funding the National Undersea Research Program, which 
supports two Pisces submersibles located in Hawaii. The two HOVs 


are known for their work exploring seamounts in the Pacific Ocean, 
including the newest Hawaiian volcano. Robison bemoans the loss of 
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VOYAGER TO THE BOTTOM OF THE SEA 


DIVE DEPTH (m) 


A US$40-million upgrade to Alvin will lead to noticeable improvements for researchers, but the submersible 
will have the same depth limit, 4,500 metres, as the previous one. Only five nations — and entrepreneur 


James Cameron — have research submersibles that can go deeper than 4,000 metres. 


Thrusters 


Improvements include an 
added front thruster to 
enhance manoeuvrability. 
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Batteries 

Current lead-acid 
batteries will need to 
be replaced for Alvin 
to dive deeper. 


Personnel sphere 
An 18% increase in its 
size means more room 


for scientists. Viewports 


Cameras 

Alvin has been outfitted 
with brighter lights and 
high-definition cameras. 
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Carrying capacity 
has been doubled. 
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Two extra, larger windows 
give scientists better views. 


all the other US scientific submersibles. “As good as Alvin is,’ he says 
“that isn’t enough.” 

Veteran explorer Robert Ballard is not shedding any tears over the 
HOVs. “I consider that a dead paradigm,” says the marine geologist 
who discovered the Titanic and co-led the cruise that discovered the 
first hydrothermal vents; he used Alvin to explore them both. He now 
relies on autonomous underwater vehicles (AUVs) and remotely oper- 
ated vehicles (ROVs), which can be controlled by researchers on a ship 
or from halfway across the world. “AUVs augmented with ROVs is 
where I see the future. I don't need any people in them.” 

Alvin is limited to 4-5 hours of work at depth, whereas ROVs can 
spend days on the bottom and some AUVs can explore for months. 
And unmanned vehicles don’t require such large support ships and 
crews as HOVs, which can trim the cost of a mission substantially. 
Thanks to their data links, ROVs also enable essentially unlimited 
numbers of researchers around the world to take part in missions. 

The ROV Jason 2 has become a favourite tool of US marine geolo- 
gists, who love its flexibility in exploring and collecting samples down 
to 6,500 metres. But its heavy steel cable, which supplies power and 
transmits data, prevents the ROV from diving deeper and complicates 
missions. 

Just up the hill from where Alvin is taking shape at Woods Hole, 
researchers are working on an experimental vehicle that will sever 
that troublesome umbilical cord while retaining the advantages of 
an ROV. Canary yellow and the size of a delivery van, Nereus made 
headlines in 2009 by reaching the bottom of Challenger Deep in the 
South Pacific — at 11,000 metres, the lowest spot on Earth. Nereus can 
operate either autonomously or connected to a surface ship through 
a fibre-optic cable. 

But the tether, just three times the thickness of 
a human hair, breaks easily. So the Woods Hole 
team is now working to cut the physical link with 
the surface. This month, off Guam, the crew will 
test a laser system on Nereus that sends the data 
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Cameron visit: 


196 | NATURE | VOL 489 | 13 SEPTEMBER 2012 | CORRECTED 


toa receiver hanging down froma ship. Researchers will control Nereus 
using acoustic signals from the ship. 

Despite the advantages of ROVs, most scientists say that there is 
no substitute for exploring the depths in person. “There is still a place 
for having a human brain and eyes at the bottom of the ocean,” says 
Humphris. To those who say there is no need for HOVs, she says “It’s 
a little bit like saying, “Why would you go to the Grand Canyon rather 
than watching a movie about it?” 

Alvin is also one of the best recruiting tools in marine science. “It’s 
what has brought the very best minds to the deep-sea oceanographic 
community,” says Lutz. “You're able to tell them, “You come work with 
us and we're going to take you a mile and a half down to the bottom 
of the sea in a submersible; as opposed to saying, “We're going to take 
you out ona ship and youre going to watch it on a television screen.” 

Private money could help to preserve some opportunities for deep- 
sea science. Eric Schmidt, former chief executive of Google, purchased 
an 83-metre-long ship in 2010 that he retrofitted for oceanographic 
science. His Schmidt Ocean Institute will start research cruises with the 
vessel next year and will test existing ROVs with the aim of purchasing 
one ina couple of years and making it available for researchers. 

And Cameron's dive aboard his one-person Deepsea Challenger fired 
up the imagination of researchers such as Lutz. “You look at it as a 
scientist and say, ‘Shit! I'd go down in that thing” 

Cameron says that his sub is available for anyone willing to put up the 
money for further expeditions. “I’m more than happy to put in the sub 
and the technology. It’s not doing any good sitting in my barn,’ he says. 

In the meantime, the hopes of the US oceanographic community ride 
on Alvin, which currently rests in pieces at Woods Hole. Looking up at 
the gleaming passenger sphere, Strickrott takes a moment to picture 
the day his crew has completed all its work and Alvin is ready to dive 
once again. “When it gets rolled out here, it’s spotless,” he says. “It’s a 
beautiful thing.” m 


Richard Monastersky is a features editor for Nature. 
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E-BLAST 


The simplest kind of electromagnetic-pulse 
source works by blowing itself up. 


EXPLOSIVES IN 


DETONATOR METAL CASING ANTENNA 


=r oe aes rows 


Coll MAGNETIC FIELD 


1. A current is set flowing around a cylindrical 
coil of wire, creating a strong magnetic field 
along the length of the source. 
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2 The detonation starts at one end, 


compressing the field as it forces the metal 
casing against the coil and short-circuits it. 


WASTED 


DESPITE 30 YEARS OF RESEARCH ON HIGH-POWER MICROWAVES, 
THE US MILITARY HAS YET TO PRODUCE A USABLE WEAPON. 


4}. The detonation rips along the 
midline, squeezing the compressed 
field out as an electromagnetic pulse. 


ELECTROMAGNETIC PULSE —+ 


ENERGY 


BY SHARON WEINBERGER 


or some Pentagon officials, the 
demonstration in October 2007 must 
have seemed like a dream come true — 
an opportunity to blast reporters with 

a beam of energy that causes searing pain. 
The event in Quantico, Virginia, was to be 
a rare public showing for the US Air Force's 
Active Denial System: a prototype non-lethal 
crowd-control weapon that emits a beam of 
microwaves at 95 gigahertz. Radiation at that 
frequency penetrates less than half a milli- 
metre into the skin, so the beam was supposed 
to deliver an intense burning sensation to any- 
one in its path, forcing them to move away, but 
without, in theory, causing permanent damage. 
However, the day of the test was cold and 
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rainy. The water droplets in the air did what 
moisture always does: they absorbed the 
microwaves. And when some of the reporters 
volunteered to expose themselves to the atten- 
uated beam, they found that on such a raw day, 
the warmth was very pleasant. 

A demonstration of the system on a sunny 
day this March proved more successful. But 
that hasn't changed a fundamental reality for 
the Pentagon's only acknowledged, fully devel- 
oped high-power microwave (HPM) weapon: 
no one seems to want it. Although the Active 
Denial System works (mostly) as advertised, its 
massive size, energy consumption and techni- 
cal complexity make it effectively unusable on 
the battlefield. 
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The story is much the same in other areas of 
HPM weapons development, which began as 
an East-West technology race nearly 50 years 
ago. In the United States, where spending on 
electromagnetic weapons is down from cold- 
war levels, but remains at some US$47 million 
per year, progress is elusive. “There's lots of 
smoke and mirrors,’ says Peter Zimmerman, 
an emeritus nuclear physicist at King’s Col- 
lege London and former chief scientist of the 
US Arms Control and Disarmament Agency 
in Washington DC. Although future research 
may yield scientific progress, he adds, “I cannot 
see they will build a useful, deployable weapon’. 

For many critics, the US HPM programme 
has become a study in wishful thinking, exac- 
erbated by a culture of secrecy that makes real 
progress even more difficult. 

The quest to build an electromagnetic 
weapon — an e-bomb, in military jargon — 
was sparked on 8 July 1962, when the United 
States carried out Starfish Prime, the largest 
high-altitude nuclear test that had ever been 
attempted. The 1.4-megaton thermonuclear 
warhead, detonated 400 kilometres above 
the central Pacific Ocean at 9 seconds past 
11 p.m., Hawaii time, blasted huge swarms of 
charged particles outwards along Earth’s mag- 
netic field. Their gyrations generated a pulse 
of microwave energy that drove measuring 
instruments off the scale. Artificial auroras lit 
up the night across swathes of ocean. And in 
Honolulu, more than 1,300 kilometres from 
the detonation point, the pulse set off burglar 
alarms, knocked out street lights and tripped 
power-line circuit breakers. 

Nothing like Starfish Prime has been seen 
since August 1963, when the Partial Test Ban 
Treaty outlawed nuclear explosions anywhere 
but underground. But the test showcased 
the potential destructiveness of an electro- 
magnetic pulse to military planners on both 
sides of the cold-war divide, and launched 
them into a race to harness it as a weapon using 
anon-nuclear source. 


POWER CUT 
The US Air Force has been the main funder 
of the country’s HPM programme from the 
beginning. At first, its goal was a weapon 
capable of taking out an enemy’s computers, 
communication systems and other electron- 
ics. In theory, the idea remains compelling: 
an e-bomb would be able to fire microwave 
‘bullets’ at the speed of light and, if tuned to 
the right frequencies, disable its targets with- 
out collateral damage. Cars could be stopped 
in their tracks, radars blinded and computers 
destroyed, with no need for high explosives. 
But that goal has foundered on the HPM 
weapon's main technical challenge: gener- 
ating a pulse that is directed enough to pick 
out a specific target and powerful enough to 
have an effect when it gets there, ideally using 
a generator that is small and light enough for 
an aeroplane or missile to lift. 


A battery-powered device can generate an 
HPM pulse, but producing the kind of highly 
concentrated power needed to destroy elec- 
tronics typically requires detonating a conven- 
tional explosive inside a device that destroys 
itself in the act of pulsing (see ‘E-blast’). 
Because doing this inside a piloted aircraft is 
risky — “a few pounds in the right place will 
take down anything’, notes Zimmerman — the 
Air Force has in recent years pursued HPM 
weapons designed for single-use missiles. 

For example, the Counter-electronics High- 
power Microwave Advanced Missile Project 
(CHAMP) is an experimental cruise missile 
designed to take out electronic targets such as 
production sites for weapons of mass destruc- 
tion. Neither the Air Force nor Boeing, its main 
contractor for CHAMP, will discuss technical 
details of the programme. But the project is just 
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the University of New Mexico in Albuquerque 
who is involved in a multi-university research 
initiative funded by the US defence depart- 
ment to improve such predictions. “When an 
electromagnetic ray or wave-beam enters the 
enclosure,” he says, “it will continue bouncing 
around and not repeat its trajectory.’ 

In short, more than 20 years after the Gypsy 
test, scientists still can’t reliably predict the 
damage a weapon would do. And that is with- 
out even considering the countermeasures 
that an adversary might use, which could be as 
elementary as surrounding sensitive electronics 
with a Faraday cage — the equivalent of the alu- 
minium mesh used to shield microwave ovens. 

The effort to disable electronics has 
remained mostly secret. But in 2001, the Air 
Force publicly announced that it had made 
substantial progress in developing microwave 


“TD RATHER TERRORISTS SPENT ALL THEIR TIME 
WORKING ON AN HPM WEAPON THAN CAR BOMBS.” 


a prototype; when CHAMP was flight-tested 
last year, it still didn’t include the HPM payload. 
It is possible to make a microwave genera- 
tor compact enough for a missile. Engineers at 
Texas Tech University in Lubbock have devel- 
oped an experimental explosive-based source 
less than 2 metres long and 16 centimetres in 
diameter (M. A. Elsayed et al. Rev. Sci. Instrum. 
83, 024705; 2012). But lead developer Andreas 
Neuber points out that there are physical lim- 
its: to maximize the microwave power while 
keeping the system small, the engineers had 
to increase the internal electrical field. The 
result can be a catastrophic failure of the sys- 
tem’s insulating materials that short-circuits it 
before the system can build up much power. 

Even if the military succeeds in packaging 
an HPM system, there is serious doubt over 
how effective the pulses will be when they hit 
their targets. In the late 1980s, a device called 
Gypsy successfully took out a bank of personal 
computers during the Air Force's first unclassi- 
fied test of a microwave weapon. But building 
on that success “became an incredibly difficult 
research project’, says Doug Beason, a physicist 
who was associate director for threat reduction 
at the Los Alamos National Laboratory in New 
Mexico until 2008, and wrote The E-Bomb (Da 
Capo, 2005), a discussion of directed-energy 
weapons. “You could understand how micro- 
waves affected components of electronic cir- 
cuits — transistors, capacitors, inductors and 
all that. But when you started putting them 
together in complex circuits, it became more 
ofa stochastic process and you wouldn't always 
get the same results each time.” 

There is similar uncertainty over how 
electromagnetic energy flows through enclo- 
sures such as buildings. The process is chaotic, 
says Edl Schamiloglu, an electrical engineer at 


weapons that target people, when it unveiled 
the Active Denial System. 

Development of the system began in the 
1990s with the Air Force’s efforts to explore 
the biological effects of microwaves. A project 
code-named Hello studied how to modulate 
the clicking or buzzing sounds produced by 
microwave heating in the inner ear, to produce 
psychologically devastating ‘voices in the head. 
‘Goodbye’ explored the use of microwaves for 
crowd control. And ‘Good Night looked at 
whether they could be used to kill people. 


HELLO GOODBYE 

Only the Goodbye effect went into develop- 
ment as a weapon. Further bioeffects research 
was conducted in secrecy at Brooks Air Force 
Base near San Antonio in Texas, but even that 
programme almost stalled when the weapon 
was ready to move from animal to human 
testing. Hans Mark, a nuclear engineer at the 
University of Texas at Austin who was then the 
Pentagon’s director of defence research and 
engineering, paid a visit to Brooks in 2000 to 
check out the work. “Dr Mark didn't believe 
in the effect; recalls Beason, “and he actually 
had a shouting match with one of the main 
researchers.” But Mark’s approval was needed 
to advance the project, so he agreed to be sub- 
jected to the beam. 

The Air Force got its human tests. The 
Brooks scientists joke that “you've never seen 
a political appointee run so fast’, says Beason. 

Mark says that his doubts about the Good- 
bye effect were rooted in what he calls the 
“extravagant claims” made by its advocates. 
If nothing else, he says, the superconduct- 
ing electromagnet that powered the system’s 
pulse generator required a cooling system 
too big and cumbersome to be used in the 
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The 1962 Starfish Prime nuclear test set off an electromagnetic pulse that sparked a weapons programme. 


field. Mark says that he allowed the system to 
proceed to human testing not because he was 
convinced that it would work, but because after 
exposing himself to the beam, he decided that 
human testing at least wouldn't harm anyone. 
“Almost all of this programme has been a waste 
of money,’ he says. 

Mark’s concerns have proved prescient: 
efforts to deploy the weapon have been futile. 
Atthe 2001 unveiling, the defence department 
touted the Active Denial System for use in 
peacekeeping missions in places such as Kos- 
ovo and Somalia. But after the invasion of Iraq 
in 2003, when the US Joint Non-Lethal Weap- 
ons Directorate offered to deploy the Active 
Denial System to the region, it was rebuffed. 

“We knew it wasn't reliable, said Franz Gayl, 
the Marine Corps’s science and technology 
adviser, in an interview last year. Worse, he said, 
the pulse generator was so big that it had to be 
carried on its own utility vehicle. “That was a 
recipe for disaster,’ said Gayl, “because the oper- 
ators are going to bea target.” And worst of all, 
he said, before use the system had to be cooled 
down to 4 kelvin — a process that took 16 hours. 

The defence department tried to deploy 
the weapon in Afghanistan in 2010, but it was 
sent home unused. In the same year, Califor- 
nia rejected a smaller version meant for use in 
prisons. The device was built by defence con- 
tractor Raytheon of Waltham, Massachusetts, 
which declines to discuss it. 

Other weapons have fared little better. The 
Air Force Research Laboratory developed an 
HPM system called MAXPOWER to detonate 
roadside bombs remotely, but it was the size 
of an articulated lorry — too unwieldy to be 
deployed in Afghanistan. The Joint Improvised 
Explosive Device Defeat Organization, the 
defence department’s bomb-fighting agency, 


declined to discuss the system, citing classifica- 
tion issues. But it did say that, as of 2011, it was 
not funding MAXPOWER. 

In July, General Norton Schwartz, the Air 
Force chief who retired last month, warned 
that the service would have to withdraw from 
some science efforts amid budgets cuts, but 
that HPM technology would still be pursued. 
It “clearly has potential’, he told the trade 
magazine Aviation Week e& Space Technology, 
warning that countries such as Russia could be 
ahead of the United States. 


THE MICROWAVE GAP 

The concern that other nations, or even terror- 
ists, could be working on similar technology 
seems to have been one of the prime motiva- 
tions for the US military to continue investing 
in microwave weaponry, despite the apparent 
lack of progress. According to a 2009 briefing 
on non-lethal technologies prepared by the 
Office of Naval Research and obtained under 
the Freedom of Information Act, Russia, China 
and even Iran are pursuing HPM programmes 
— and the UK Defence Science and Technol- 
ogy Laboratory at Fort Halstead is sponsoring 
a classified car-stopping programme. 

But such programmes are not necessarily 
proof that the cold-war HPM arms race is still 
going on. At least some countries may — like 
the United States — be conducting research 
out of fear of becoming vulnerable to such 
weapons. Modern technologies such as mobile 
phones are particularly susceptible to HPMs, 
says Michael Suhrke, 
head of the electro- 
magnetic effects and 
threats business unit at 
the Fraunhofer Institute 
for Technological Trend 
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Analysis in Euskirchen, Germany. 

As for HPM weapons in the hands of 
terrorists, many scientists regard that threat 
as far-fetched at best. Even if terrorist groups 
had the sophistication to carry out the neces- 
sary testing, says Yousaf Butt, a physicist in 
the high-energy astrophysics division at the 
Harvard-Smithsonian Center for Astrophysics 
in Cambridge, Massachusetts, why would 
they? A microwave weapon of any magnitude 
would probably have to be powered by explo- 
sives. And if they had that kind of material, he 
says, ‘why wouldnt they just explode it?” 

“Is it conceivable?” asks Philip Coyle, who 
in 2010-11 served as associate director for 
national security and international affairs in 
the White House Office of Science and Tech- 
nology Policy, and is now a senior fellow at the 
Center for Arms Control and Non-Prolifera- 
tion, a think tank based in Washington DC. 
“Barely, I think. I wouldn't take it for granted 
that terrorists couldn't do it. But I'd rather 
terrorists spent all their time working on [an 
HPM weapon] than car bombs.” 

Experts still disagree on whether HPMs 
might eventually make useful weapons. But 
one thing is clear: the mythical e-bomb capable 
of stopping cars or planes has not yet material- 
ized on the battlefield. Asked whether the Air 
Force had produced any operational weapons, 
its research lab said only: “Due to operational 
concerns, we are unable to respond to this 
question.” 

The secrecy that surrounds HPM weapons 
research seems to have greatly exacerbated 
technical obstacles to the programme. In 2007, 
for example, a report on directed-energy weap- 
ons by the Defense Science Board said that the 
Pentagon had not effectively used data col- 
lected by university researchers to understand 
microwave effects. The Air Force claims that 
sharing is better now. But working in a field 
shrouded in secrecy still affects how informa- 
tion is disseminated. Neuber, for example, 
could agree to answer questions for this arti- 
cle only if he replied in writing, and only after 
his responses had been cleared through the US 
Army office that sponsors his team’s work. 

“Working in an area that is to a large extent 
of military interest requires playing by a set of 
different rules to some extent,” he wrote. “Some 
flow of information is not as free as in other 
areas of the research endeavour.” 

To John Alexander, a retired army colonel 
who once headed the non-lethal weapons 
programme at Los Alamos National Labora- 
tory, the secrecy reinforces the air of fantasy 
around the whole endeavour. “My point is 
always: chemistry and physics work the same 
way for everyone, and there are smart folks out 
there, so who are you trying to fool?” he says. 
“The people not getting adequate information 
were our own commanders.” mSEE EDITORIAL P.177 


Sharon Weinberger is a freelance writer in 
Washington DC. 
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Predicting 
scientific success 


Daniel E. Acuna, Stefano Allesina and Konrad P. 
Kording present a formula to estimate the future 
h-index of life scientists. 


e research scientists often worry 
about the future of our careers. Is 
our research an exciting path or 


a dead end that will end our careers prem- 
aturely? Predicting scientific trajectories is 
a daily task for hiring committees, funding 
agencies and department heads who probe 
CVs searching for signs of scientific potential. 

One popular measure of success is physi- 
cist Jorge Hirsch’s h-index', which captures 
the quality (citations) and quantity (num- 
ber) of papers, thus representing scientific 
achievements better than either factor alone. 
A scientist has an h-index of nif he or she has 
published n articles receiving at least n cita- 
tions each’. Einstein, Darwin and Feynman, 
for example, have impressive h-indices of 96, 


63 and 53, respectively. According to Hirsch, 
an h-index of 12 for a physicist — meaning 
12 papers with at least 12 citations each — 
could qualify him or her for tenure at a major 
university. 

However, the h-index’ and similar metrics* 
can capture only past accomplishments, not 
future achievements’. Here we attempt to 
predict the future h-index of scientists on the 
basis of features found in most CVs. 

We maintain that the best way of predict- 
ing a scientist’s future success is for peers to 
evaluate scientific contributions and research 
depth, but think that our methods could be 
valuable complementary tools. 

The typical research CV contains infor- 
mation on the number of publications, 


those in high-profile journals, the h-index 
and collaborators. One can also infer inter- 
disciplinary breadth, the length and quality 
of training, the amount of funding received 
and even the standing of the scientist’s PhD 
adviser. Such factors are taken into account 
for hiring decisions, but how should they be 
weighted? Fortunately, obtaining data on the 
scientific activities of individual researchers 
has never been easier. Using all of these fea- 
tures, we can begin to probe the scientific 
enterprise statistically. 


VITAL STATISTICS 
To construct a formula to predict future 
h-index, we assembled a large data set and 
analysed it using machine-learning tech- 
niques. Our initial sample from academic- 
tree.org — a crowd-sourced website listing 
scientists’ mentors, trainees and collabora- 
tors — contains the names and institutions 
of about 34,800 neuroscientists, 2,000 scien- 
tists studying the fruitfly Drosophila and 1,300 
evolutionary researchers. We matched these 
authors to records in Scopus, an online data- 
base of academic papers and citation data. We 
restricted our analysis to authors who had 
accrued an h-index greater than 4 (to exclude 
inactive scientists); to publications after 1995 
(because electronic records are sparse before 
then); to authors who had published their 
first manuscript in the past 5-12 years; and 
to authors who were identifiable in Scopus. 
That left us with 3,085 neuroscientists, 
57 Drosophila researchers and 151 evolu- 
tionary scientists for whom we constructed a 
history of publication, citation and funding. 
For each year since the first article pub- 
lished by a given scientist, we used the features 
that were available at the time to forecast their 
h-index a number of years into the future. 
For example, we reconstructed how the CV 
features of a scientist looked five years after 
publishing his or her first article, and found 
a relationship between those features and the 
reconstructed h-index five years on. 
Starting with neuroscientists, we 
attempted to predict the h-index of each 
scientist 5 years ahead — a timescale rel- 
evant for tenure decisions — using a linear 
regression with elastic 
net regularization® (see 
Supplementary Infor- 
mation at go.nature. 
com/mtvuzr). The 


For more on science 
metrics, see: 
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METRICS 


Predict your future h-index 


These are approximate equations for 
predicting the h-index of neuroscientists in 
the future. They are probably reasonably 


© Predicting next year (R?=0.92): 


precise for life scientists, but likely to be less 
meaningful for the other sciences. Try it for 
yourself online at go.nature.com/z4rroc. 


h,,=0.76+0.37Vn+0.97h—0.07y+0.02j+0.03q 


© Predicting 5 years into the future (R*=0.67): 


h,s=4+ 1.58V7+ 0.86h-0.35y+0.06/+0.2q 


© Predicting 10 years into the future (R?=0.48): 


hiyo=8.73 + 1.33V0+ 0.48h-0.41y+0.52j)+0.82q 


Key: n, number of articles written; h, current h-index; y, years since publishing first article; 
j, number of distinct journals published in; g, number of articles in Nature, Science, Nature 
Neuroscience, Proceedings of the National Academy of Sciences and Neuron. 


PATHS TO SUCCESS 


The accuracy of future h-index prediction decreases over time, but the Acuna et a/. formula predicts 
future h-index better than does current h-index alone (left). The contribution of each factor to the 
formula accuracy also changes over time (right). Shading indicates 95% confidence error bars. 


== Acuna et al. 
—— h-index 


Percentage variance explained (R2) 


Years ahead 


> model predicted the future h-index 
accurately, yielding a respectable R’=0.67, 
cross-validated across scientists (an R’ of 1 
would imply that the model predicts the data 
perfectly). A simplified model containing 
only the number of published articles, the 
h-index, years since first publication, num- 
ber of publications in prestigious neuro- 
science journals (Nature, Science, Nature 
Neuroscience, Neuron and the Proceedings of 
the National Academy of Sciences) and the 
number of distinct journals still performed 
nearly equally well (R’ = 0.66; see ‘Predict 
your future h-index’). 

Predicting the future careers of Dros- 
ophila and evolutionary scientists leads to 
somewhat worse predictions (R*=0.54 and 
R’=0.61, respectively, based on scientists 
3-15 years into their careers) but still bet- 
ter than predictions based on the h-index 
alone (R’=0.38 and R*=0.39, respectively). 
This indicates that generalizations to other 
fields within and outside of life science may 
be limited’. But for neuroscientists, at least, 
the predictions extend well to longer peri- 
ods of time, such as ten years into the future 
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Standardized coefficient 
nm 


== In top journals == h-index 
Distinct journals === Career 
— articles length 


Years ahead 


(R’=0.52). Over time, using just the h-index 
performs much worse than taking all features 
into account (see ‘Paths to success; left panel). 
The main five predictive features change 
in importance for predicting h-indices over 
increasingly longer periods (see ‘Paths to 
success, right panel). The power of the 
h-index declines. The number of articles 
written, the diversity of publication in dis- 
tinct journals and the number of articles 
published in five prestigious journals all 
become increasingly influential over time. 


FUTURE FORTUNES 

It is risky to make any causal interpretations 
of these results. However, we will briefly 
speculate on why these features might be 
important predictors of future success. Some 
features directly affect the potential for a 
high h-index, such as the number of articles 
written. These features can also indirectly 
affect a scientist’s future success, because sci- 
entists who are productive and publish many 
papers tend to remain productive. Publish- 
ing in many different journals may lead to 
fewer overlapping populations of scientists 
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who cite the work, and hence higher growth 
potential for articles. A scientist who has 
published in several distinct journals is also 
likely to be someone with broad training 
who contributes in many ways. The num- 
ber of publications in leading journals can 
increase the visibility of a scientist's other 
papers, past and future. 

If promotion, hiring or funding were 
largely based on indices (h-index, the model 
used here or any other measure), then some 
scientists would adapt their behaviour to 
maximize their chances of success. Models 
such as ours that take into account several 
dimensions of scientific careers should be 
more difficult for researchers to game than 
those that focus on a single measure. 

Our formula is particularly useful for 
funding agencies, peer reviewers and hir- 
ing committees who have to deal with vast 
numbers of applications and can give each 
only a cursory examination. Statistical 
techniques have the advantage of returning 
results instantaneously and in an unbiased 
way. Building and analysing massive data 
sets to track scientific careers could also help 
to identify potential gender, racial and other 
biases’ and advance our understanding of 
how science develops. 

Although our findings and predictions 
may not alleviate scientists’ angst over their 
careers, the results offer some comfort by 
showing that the future is not so random. 
The occasional rejection of a paper may feel 
unjust and indiscriminate, but in the long run, 
such factors seem to average out, rendering 
h-index trajectories relatively predictable. m 
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SOURCE: J. M. BERG/NIGMS 


Well-funded investigators 
Should receive extra scrutiny 


An added layer of review for elite grant-holders upholds the mission 
of the National Institutes of Health, says Jeremy M. Berg. 


he US National Institutes of Health 

| (NIH) receives many more grant 

applications proposing outstanding 

scientific projects than its budget can sup- 

port. The overall success rate for grants in 
fiscal year 2011 was 18% — a historic low. 

Last month, the agency announced a policy 
that will apply to the next round of RO1 grants. 
Applications from investigators who receive 
more than US$1 million a year in funding 
from the NIH (not including ‘indirect costs’ 
that go to the investigator's university) will be 
subjected to extra review. The agency's advi- 
sory councils will examine how distinct the 
proposed project is from the investigator’s 
other funded work. If it is a grant renewal, 
they will consider whether the project has 
been productive, and look at its value to 
the investigator's research programme and 
collaborations. This analysis will be provided 
to NIH programme staff, who will make 
funding recommendations. 

The policy has been criticized for adding 
to the NIH’s administrative burden, and for 
potentially penalizing the most productive 
scientists. However, I believe that it will com- 
plement peer review and help to determine 
the best possible portfolio of NIH-funded 
research, short and long term. 

The National Institute of General Medical 
Sciences (NIGMS) in Bethesda, Maryland 
— where I was director between 2003 and 
2011 — has, for around two decades, given 
extra scrutiny to applications from investiga- 
tors receiving more than $750,000 per year in 
costs from all funding sources, including the 
pending NIGMS application. This policy is 
nota cap; rather, it is a useful tool for manag- 
ing taxpayer resources to achieve the goals 
of the NIGMS and the NIH. The scrutiny 
has influenced the distribution of resources, 
either by denying or reducing funding for 
well-off applicants, or providing grants on 
the condition that others will not be renewed. 
This frees up money for other investigators 
and allows those who are well-funded to 
change the course of their research. Assum- 
ing that the NIH policy can be implemented 
effectively, it is a step in the right direction. 

Critics of the NIH policy argue that 
although peer review is not perfect, it is the 
best system for evaluating scientific projects. 
Many reviewers try to ensure that a new pro- 
ject does not overlap significantly with one 
that is already funded. But they do not have 


full access to NIH portfolio data, nor the 
time to analyse the information. 

Peer review has several limitations. To 
compare and rank grant applications across 
hundreds of NIH sections, administrators 
assign each peer-review score a percentile 
value. The first percentile (the top 1%) corre- 
sponds to the most highly rated applications. 
These percentile values have uncertain- 
ties that stem from the potential biases of 


SCORE DRAW 


Higher grant scores do not imply more papers. 
Renewal grants show no significant differences 
up to the 13th percentile; new grants showed 

no significant correlation over the entire range. 
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primary reviewers, the characteristics of 
the process used to calculate percentiles, 
and the fact that an application is judged by 
whether it will be an important future scien- 
tific advance — which is inherently difficult 
to predict. 


BANG FOR BUCK 

During my time as director of the NIGMS, 
I analysed the power of peer-review scores 
to predict scientific productivity. This was 
judged by various measures, including num- 
bers of publications, citations and highly cited 
publications four years after the grants were 
funded. The distribution of percentile scores 
for applications did correlate with future 
differences in productivity to some extent. 
But applications that were separated by five 
or even ten percentile points did not differ 
significantly in subsequent productivity (see 
‘Score draw’). Therefore, funding grants in 
strict percentile order does not necessarily 
mean that ensuing investigations will be of 
higher quality. So percentile score should not 


be the only factor taken into consideration 
when making funding decisions. 

There are good reasons to consider how 
much other funding an investigator is 
receiving. Projects in one lab almost always 
overlap, even if they are addressing different 
questions, so some of the running costs will 
already be covered. Giving another grant to 
the same investigator might have less impact 
than giving it to someone with nearly the 
same percentile score who has little or no 
other funding. In another NIGMS analysis, I 
showed that research productivity (measured 
by numbers of publications or citations and 
averaged over groups of investigators with 
similar levels of support) did not increase 
consistently as the level of support increased, 
but reached a plateau near $700,000 in annual 
direct costs per investigator. 

Some investigators are better than others 
at managing large amounts of resources. If 
an investigator is not performing as expected 
given the level of support, using the funds to 
support another researcher might be a bet- 
ter investment. The NIH needs to invest in a 
broad range of research; if scientists are well 
funded, their area could already be receiving 
enough support from other sources. 

The layer of extra scrutiny is not ideal. 
I would prefer the limit to be less than 
$1 million and to include funding outside the 
NIH. Exceptions to the policy also mean that 
it might be easy to dodge. For example, grants 
with multiple principal investigators are scru- 
tinized only if all receive more than $1 mil- 
lion in NIH funding, so lam concerned that 
some applicants may avoid extra review by 
adding a co-investigator who receives less. 

I believe that special consideration should 
be given to investigators with strong propos- 
als who have few or no other sources of fund- 
ing, such as those at the beginning of their 
careers or established, productive investiga- 
tors. Funding these applicants would prob- 
ably have a bigger impact — by helping to 
develop a new lab or keeping an effective 
one functioning — rather than providing 
incremental support to an investigator who 
already has substantial other support. m 
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| COMMENT | BOOKS & ARTS 


By KAREL CAPEK | 


Speculative fiction from 1904 to 1933 is often overshadowed by later ‘golden age’ work. 


SCIENCE FICTION 


The radium age 


Joshua Glenn explores the dark, fascinating, largely 
forgotten science fiction of the early twentieth century. 


e think we know science fic- 
tion. There were the ‘scientific 
romance’ years that stretched 


from around the mid-1860s to 1903, after 
which H. G. Wells lost his touch. And there 
was the ‘golden age’ from 1934 to the mid- 
1960s. But between those periods — and 
overshadowed by them — was a fascinating 
era that gave us such enduring memes as the 
berserk robot, the tyrannical superman and 
the sinister telepath. 

I call that period, from 1904 to 1933, sci-fi’s 
‘radium age’ It emerged when the speed 
of change in science and technology was 
inducing vertigo on both sides of the Atlantic. 
More cynical than its Victorian precursor 
yet less hard-boiled than the generation 
that followed, this is sci-fi offering a dizzy- 
ing, visionary blend of acerbic social com- 
mentary and shock tactics. It yields telling 
insights into its context, the early twentieth 
century. Plus, it is fun to read. 

The scientific 
romances that pre- 
ceded this era were 


> NATURE.COM 
For more on science 


fiction, see the strong on the fantasti- 
Nature Web Focus: cal but otherwise very 
go.ature.com/mgc2j different. The utopian 


fantasies of Edward Bulwer-Lytton (The 
Coming Race, 1871), for instance, and the fan- 
tastic voyages by writers such as Jules Verne 
(Journey to the Centre of the Earth, 1864) and 
Edgar Allan Poe (The Unparalleled Adven- 
ture of One Hans Pfaall, 1835) were largely 
sentimental literature. They were by and for 
people who were inspired by US inventor 
Thomas Edison and believed that modern 
science and technology would improve the 
human condition and usher in the triumph 


of millennia-old Christian values. 

Radium-age writers were not out to 
beguile. They depict a human condition 
subverted or perverted by science and tech- 
nology, not improved or redeemed. Aldous 
Huxley’s 1932 Brave New World, with its dev- 
astating satire on corporate tyranny, behav- 
ioural conditioning and the advancement of 
biotechnology, is far from unique. Radium- 
age sci-fi tends towards the prophetic and 
uncanny, reflecting an era that saw the rise 
of nuclear physics and the revelation that the 
familiar — matter itself — is strange, even 
alien. The 1896 discovery of radioactivity, 
which led to the early twentieth-century 
insight that the atom is, at least in part, a 
state of energy, constantly in movement, is 
the perfect metaphor for an era in which life 
itself seemed out of control. 

As demonstrated by Philipp Blom in The 
Vertigo Years: Europe 1900-1914 (Basic 
Books, 2008), the actual experience of time 
and space was altered in this period by the 
proliferation of a range of stunning techno- 
logical developments, from the telegraph and 
the telephone to aeroplanes and cinematic 
film. Thinking and perception were radi- 
cally reoriented by developments in science, 
philosophy and the arts. These included Ein- 
stein’s special theory of relativity; the work 
of phenomenological philosophers such as 
Edmund Husserl, who sought to objectively 
describe the subjective workings of con- 
sciousness; and Cubism, the artistic move- 
ment spearheaded by Pablo Picasso and 
Georges Braque that explored distortion and 
deconstruction. Social and cultural forms 
and norms, such as women’s subordination to 
men — once regarded as natural, permanent 
‘elements’ — were disintegrating. 

During this astonishing period, sci-fi 
writers were obsessed with the future. 
The scientific romantics had conjured up 
simplistic utopias that remained firmly 
grounded in contemporary realities. By con- 
trast, the radium-age novels, stories, movies 
and plays (see ‘Key radium-age works’) often 
lift off into previously unexplored realms. 

Instead of offering progressive solutions 


NOVEL VISIONS 


Key radium-age works 


Karel Capek R.U.R. (1920) 

Arthur Conan Doyle The Lost World (1912) 
Hugo Gernsback Ralph 124C 41+ 
(1911-12) 

H. Rider Haggard When the World Shook 
(1919) 

William Hope Hodgson The Night Land 
(1912) 

Aldous Huxley Brave New World (1932) 
Rudyard Kipling With the Night Mail (1905) 
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Jack London The Scarlet Plague (1912) 
Charlotte Perkins Gilman Herland (1915) 
Edward Shanks The People of the Ruins 
(1920) 

George Bernard Shaw Back to Methuselah 
(1921) 

Olaf Stapledon Last and First Men (1930) 
Thea von Harbou Metropolis (1926) 
Philip Wylie Gladiator (1930) 

Yevgeny Zamyatin WE (1921) 


to social unrest, writers satirized and exag- 
gerated its causes and effects. The out-of- 
control robot can be read as a criticism of 
the efficiency-oriented theories of Frederick 
Winslow Taylor and the practices of industri- 
alist Henry Ford. So R.U.R. (Czechoslovakian 
writer Karel Capek’s 1920 play, which intro- 
duced the term ‘robot’ to the English lan- 
guage) portrays mass production as alienating 
at best. In Jack London's post-apocalyptic The 
Scarlet Plague (1912), a race of barbarians 
descended from San Francisco's brutalized 
underclass roam the city’s devastated remains 
after the fatal pandemic of 2013. And Char- 
lotte Perkins Gilman’s feminist novel Herland 
(1915) imagines an ideal community in which 
women aren't merely emancipated, but have 
done away with men altogether. 

Politics are inevitably part of the mix — 
this is an era that encompassed the First 
World War, the Russian revolutions, and 
the rise of radical left- and right-wing move- 
ments. Many sci-fi authors, such as Capek 
and Yevgeny Zamyatin, were leftists and lib- 
erals. But more conservative authors shared 
their utopianism and cynicism. For instance, 
Arthur Conan Doyle, inventor of Sherlock 
Holmes, wrote a series of ripping yarns star- 
ring a Professor Challenger, who discovers 
surviving dinosaurs, travels to Earth’s “sen- 
sory cortex” and witnesses the end of life on 
the planet — all the while making the case 
for reconciling imagination and intuition 
with a sceptical scientific method. 

Similarly, H. Rider Haggard, who had 
made his name with King Solomon’ Mines 
(1885) and the Quartermain novels of the 
1880s, created a disputatious trio in 1919’s 
When the World Shook — an idealistic 
Anglican minister, a sardonic doctor and 
an adventurer whose world view hovers 
somewhere in between. Meanwhile, both 
the leftist London (in The Scarlet Plague) 
and the conservative English poet Edward 
Shanks (in The People of the Ruins) agree that 
the destruction of modern western society 
wouldn't be an entirely bad thing. 

Fans of Philip K. Dick or Ursula K. Le Guin 
— writers belonging to what literary theorist 
Fredric Jameson has termed the “anti-anti- 
utopian” trend of the late 1960s and early 
1970s — will find provocative antecedents 
here. Reading the dangerous visions of 
radium-age sci-fi, published in times as vol- 
atile as our own, destabilizes everything we 
take for granted. These books remind us that 
we need to regard our twenty-first-century 
forms and norms without sentimentality. m 


Joshua Glenn edits the blog HiLobrow, and 
co-founded HiLoBooks to reissue ‘radium 
age’ science fiction. HiLoBooks edition of 
Arthur Conan Doyle’ The Poison Belt is out 
now; H. Rider Haggard’ When the World 
Shook is published in October. 

e-mail: jglenn@hilobrow.com 


Books in brief 


Higgs Discovery: The Power of Empty Space 

Lisa Randall BODLEY HEAD 64 pp. £4.99 (2012) 

The Higgs bombshell on 4 July rocked the world of physics. In 

this slim volume, theoretical physicist Lisa Randall analyses 

the significance and implications of that momentous finding at 
Switzerland’s Large Hadron Collider. She offers clear accounts of 
the Higgs mechanism and the role and modes of the Higgs’s decay; 
follows the seven-month lead-up to the discovery; and speculates 
about what it all might mean for other areas of exploration, such as 
supersymmetry. A lucid, deft and engaging summation of dogged 
determination and “heroic engineering”. 


The Human Quest: Prospering Within Planetary Boundaries 

Johan Rockstrém and Mattias Klum STOCKHOLM TEXT 314 pp. $9.99 
(2012) 

In 2009, Johan Rockstrém of the Stockholm Resilience Centre 

and his colleagues set out in Nature nine ‘planetary boundaries’ — 
numerical limits for processes that affect Earth’s capacity to support 
human life, such as freshwater use and climate change. This lavishly 
illustrated e-book — with a foreword by former US president Bill 
Clinton and video clips from photographer Mattias Klum — extends 
this idea, laying out pressures and tipping points. Paramount among 
the changes needed, Rockstrém says, is a big shift in behaviour. 


Lost Antarctica: Adventures in a Disappearing Land 

James McClintock PALGRAVE MACMILLAN 256 pp. £16.99 (2012) 

The sight of 50,000 king penguins on the Crozet Island Archipelago 
in the early 1980s sparked marine biologist James McClintock’s 
fascination with Antarctic fauna. Now a veteran of the extreme south, 
McClintock shares the otherworldly wonders unveiled by decades 
of research. The book is packed with joys, from soft-coral ‘trees’ that 
replant themselves to the snoozing Weddell seal, stinking of putrid 
fish, that the author encountered in a dive hut. Running like a chill 
current through all is the climate-driven decimation of the ice on 
which these ecosystems depend. 


Before Galileo: The Birth of Modern Science in Medieval Europe 
John Freely OVERLOOK PRESS 352 pp. £18.51 (2012) 

A thousand years before Galileo, the transmission of knowledge that 
survived the burning of the ancient Library of Alexandria began. 
Physicist John Freely traces this “tenuous Ariadne’s thread” of 
classical learning that unspooled from Egypt through Byzantium 
and the Islamic world, finally emerging as Latin texts. Focusing on 
the trailblazers through this extraordinary millennium — from Bede, 
Averroés and al-Khwarizmi to Adelard of Bath, Robert Grosseteste 
and Roger Bacon — Freely ends with a coda on Copernicus, Kepler, 
Galileo himself and Newton. Shoulders of giants indeed. 


The Scientists: An Epic of Discovery 

edited by Andrew Robinson THAMES & HUDSON 304 pp. £24.95 (2012) 
The human face of scientific breakthroughs from the sixteenth to 
the twentieth centuries is spotlit in this sumptuously illustrated 
volume. Science writer Andrew Robinson, editing contributions from 
a stellar team of authors, groups 43 greats into six broad areas: 
Universe, Earth, Molecules and Matter, Inside the Atom, Life, and 
Body and Mind. From Alan Turing and Marie Curie to William Harvey 
and Chandrasekhar Venkata Raman, this is a sampler of the driven, 
complex, fascinating characters who fomented scientific revolutions. 
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James Balog has developed new photographic equipment to monitor changes in glaciers such as the Ilulissat in Greenland. 


Q&A James Balog 


Archivist of ice 


For six years, photographer James Balog has trained his lens on ice, capturing time-lapse 
images that have helped scientists to study how glaciers and ice sheets respond to climate 
conditions. With the documentary Chasing Ice soon to debut in US cinemas, Balog talks 


about the loss of landscapes. 


Why are you 
interested in 

frozen landscapes? 
When I was six years 
old, I had to walk 
home from school in 
a heavy snowstorm. 
It was a great contest 
between myself and 
nature. As a young man, I started ice climb- 
ing and mountaineering in the White Moun- 
tains in New Hampshire. When I looked at 
photographs of glaciers in Alaska I knew I 
wanted to spend time there. 


When did you start photographing 

glaciers in time lapse? 

After being blown away by the glacial retreat 
I saw while shooting a National Geographic 
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Portraits of Vanishing Glaciers 
Rizzoli International: 2012. 288 pp. £29.95 


Chasing Ice 
Jeff Orlowski: 2012 


cover story, ‘The big thaw, in 2006. The 
next year I created the Extreme Ice Survey, 
deploying time-lapse cameras to monitor 
retreating ice in the Arctic and mountain 
areas, some so remote that our team was 
probably the first to visit them. Tempera- 
tures could reach -40 °C and the winds 
150 miles per hour. We had to tackle deep 
snow, torrential rain and falling rocks. Off- 
the-shelf gear wasn't robust enough, and it 
took six months of experimenting to come 
up with a reliable time-lapse system. Right 
now, we have 27 such cameras at 18 glaciers 
in Alaska, Greenland, Iceland, the Rocky 
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Mountains and at Mount Everest. Shoot- 
ing every half-hour of daylight year-round, 
each one generates 8,500 frames per year. 
The footage provides scientists with infor- 
mation on the mechanics of glacial melting 
and gives the public evidence of how rapidly 
Earth’s climate is changing. 


What is the most dramatic moment 

you’ve caught? 

Every year there are calving events in which 
ice falls off glaciers into the sea. The rate of 
ice loss in Greenland has doubled during 
the past 20 years, and this summer we've 
seen unprecedented rates of surface melt- 
ing. We expected the Ilulissat Glacier on the 
west coast of Greenland to have a massive 
discharge of ice, so in summer 2008 two of 
my team members stood watch for weeks. 
They caught the biggest calving event ever 
recorded on film. It was as if the entire lower 
tip of Manhattan had broken off and fallen 
into the ocean, like an urban skyline top- 
pling, with skyscraper-sized blocks of ice 
submerging and resurfacing. 


What’s the aim of the Extreme Ice Survey? 

The problem of climate change is percep- 
tion. Human brains and our economic sys- 
tem favour the status quo. We aim to collect 
powerful visual evidence so that people 
can understand what's really happening. 


J. BALOG/EXTREME ICE SURVEY 
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When satellites capture retreating 
glaciers from hundreds of kilometres 
away, the images may be beautiful but 
they’re removed from human experi- 
ence. We live on the ground. Retreating 
glaciers are where you can see climate 
change as it happens. 


What have scientists learned from 

your images? 

In 2006, on the Greenland Ice Sheet, I 
met glaciologists who said that glaciers 
are big lumbering things that respond 
to climate change ona scale of decades 
or centuries. We have learned that gla- 
ciers respond hourly to atmospheric 
conditions. Our visuals also suggest 
that subglacial floods may trigger some 
of the bigger calving events. We have 
nearly a million pictures in the archive 
now, although there’s a lot of analysis yet 
to do. 


How do you fund the ice survey? 

A significant part of our initial fund- 
ing was from the Expedition Council 
at National Geographic. Nikon gave us 
hardware. By partnering with scientists, 
we got help from the US National Sci- 
ence Foundation and NASA. But most of 
the funding over the past five years has 
been from private donors. It has been an 
absurdly difficult project to run without 
‘big science’ government-scale funding 
and logistical support. 


Do you have a science background? 

For my master’s in geomorphology, I 
researched Colorado’s Big Thompson 
River flash flood of 1976. But I would not 
presume to call myself a scientist. When 
I was finishing my thesis, I remember 
looking at a stack of manila punchcards 
and deciding that Id rather see the world 
through a camera than through data 
analysis. The data are incredibly impor- 
tant, but my calling is to understand the 
world through art. 


What next? 

The underlying theme of my work is 
the collision between human needs and 
nature. I'd like to do more with energy, 
in part because my grandfather died in 
a Pennsylvania coalmine collapse. We 
worked in the Gulf of Mexico after the 
Deepwater Horizon oil spill and are 
documenting changes in forest cover in 
the Rockies. And I am trying to make the 
ice survey financially stable so that the 
cameras can keep going for along time. I 
feel a tremendous obligation to preserve 
a pictorial memory of these vanishing 
landscapes for the people of the future. 
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Modelling Feynman 


Daniel Cressey marvels at a gleaming depiction of the 
subatomic by the world’s leading information designer. 


efore the explosion in ‘infographics’ 
B describing everything from cocktail 

mixology to US health-care spending, 
there was Edward Tufte. 

Tufte, a statistician, is perhaps the world’s 
most celebrated information designer. In All 
Possible Photons at his Manhattan gallery, 
ET Modern, Tufte has unleashed his love of 
artistic explanation in a series of sculptures 
instantly recognizable to anyone with a pass- 
ing knowledge of particle physics. Minimalist 
‘graphics’ consisting of stainless-steel tubing 
formed into straight and undulating lines, 
Tufte’s rendering of Feynman diagrams 
transforms recondite scientific notation into 
abstract, glinting art. Tufte also plans to show 
versions more than five metres high at his 
sculpture park in Woodbury, Connecticut. 

US physicist Richard Feynman created 
elegant traceries of lines, dots and arrows to 
describe and predict how subatomic parti- 
cles interact. Feynman was not unaware of 
his diagrams’ aesthetic appeal, and famously 
drove a van painted with a selection of them. 

Tufte’s matt or polished steel sculptures, 
mounted on the walls, are shorn of explana- 
tions as well as much of the detail that makes 
them scientifically useful, such as arrows and 
labels. Some are large and dominate their wall 
space. The most powerful artwork on display 
is the collection of 120 smaller pieces that give 
the show its name — a cluster representing all 
possible space and time paths ofa particular 
interaction of photons. These form what Tufte 


All Possible Photons: The Conceptual and 
Cognitive Art of Feynman Diagrams 

ET Modern, New York. 

Until spring 2013; official opening 15 September. 


calls “a complete enumeration of everything 
that could happen’ in that instance. 

This isn’t the first time Tufte has ven- 
tured into Feynman territory. The diagrams 
feature as models of good design in Tufte's 
book Beautiful Evidence (Graphics, 2006). 
They are also referenced in one of his enor- 
mous Rocket Science sculptures of fantastical 
space probes attached to giant bazooka- 
style launchers. In the Airstream Interplan- 
etary Explorer (2011), the probe is an iconic 
silvery Airstream caravan adorned with 
Feynman diagrams. Tufte’s contention is that 
because subatomic particles everywhere in 
the Universe behave as shown by Feynman 
diagrams, these could work as communi- 
qués to other worlds. As he has put it, “Bet- 
ter the cosmopolitan verbs of Nature’s laws 
on spacecraft than the local proper nouns of 
national flags, earthly Gods and Goddesses, 
and government agency logos.” 

By focusing on the diagrams alone, the 
sculptures in All Possible Photons bring home 
the power of Feynman’s achievement. There 
is beauty in his diagrams, but the real deal is 
what they can potentially describe — which 
is everything. m 


Daniel Cressey is a reporter for Nature. 


=~ 


Edward Tufte adjusts his All Possible 6-Photon Scattering (120 Space-Time Feynman Diagrams) (2012). 
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Clarify the limits of 
climate models 


Mark Maslin and Patrick 
Austin suggest that scientists 
should explain the statistical 
uncertainties in models of 
climate-change impacts (Nature 
486, 183-184; 2012), but these 
models are tools for insight, 
not prediction. A diversity of 
models benefits both science and 
science-based policy guidance. 
We believe that future models 
will be more informative than 
those of today. The public-image 
problem of current models stems 
partly from scientists’ failures to 
identify the limitations openly. 
It is important to distinguish 
between questions for which 
current models are useful as 
prediction engines and those for 
which the models merely probe 
possibilities. The role of science is 
to reflect on the plausibility and 
relevance of such possibilities. 
Deep uncertainty is not foreign 
to policy-makers, who often 
have to weigh the advantages 
of deciding to wait against the 
potentially high costs of waiting. 
David A. Stainforth, 
Leonard A. Smith London 
School of Economics, UK. 
d.a.stainforth@lse.ac.uk 


ENCODE leads the 
way on big data 


The ENCODE project offers 
a fresh perspective on big data 
by providing an organized 
framework for genomics (www. 
nature.com/encode). Other 
big-data efforts tend to focus 
on rapidly locating needles in 
petabyte-sized haystacks (such 
as finding the Higgs boson, for 
instance), whereas ENCODE aims 
to supply a structured overview. 
ENCODES organization of 
information is hierarchical, 
with raw data at the bottom and 
layers of annotation above. The 
processed summaries become 
progressively broader — for 
example, starting at the level of 
signals representing the degree 
to which DNA is bound by 


transcription factors, moving 

on to the locations of sites where 
these factors bind, and then to 
overviews of regulatory networks. 
At the summit are the linked 
publications documenting the 
annotation. 

The ENCODE data model 
could be useful in other fields: 
for example, astronomy and 
Earth science are in the process 
of organizing their reams of data 
(M. J. Raddick and A.S. Szalay 
Science 329, 1028-1029; 2010), 
but don't yet compare with 
ENCODE in terms of the level of 
integration. 

Mark Gerstein Yale University, 
New Haven, Connecticut, USA. 
mark.gerstein@yale.edu 


Animal variants in 
sexual behaviour 


Andrew Barron and Mark 
Brown suggest that journalists, 
by merging animal and human 
behaviour, promote the myth that 
minority sexual preferences are 
an ‘illness’ (Nature 488, 151-152; 
2012). They include one of my 
articles among their examples 

— Bat bugs turn transsexual 

to avoid stabbing penises’ (see 
go.nature.com/9iwdpb) — 
presumably because it uses 
‘transsexual’ to describe insect 
behaviour. I contend that this 
term is valid shorthand for what 
the insects are doing, as well as 
catching the reader’s attention. 

Reporting of same-sex 
behaviour in animals can stray 
into troubling territory. In one 
example given by Barron and 
Brown, The Sunday Times wrote 
in 2006 that a study of male- 
oriented behaviour in sheep 
“could pave the way for breeding 
out homosexuality in humans”. 
However, the problem here is 
not the terms used to describe 
animals’ sex lives, but the leverage 
of the research paper to make a 
political point. 

Homosexual behaviour is 
condemned as unnatural in 
some places around the world. 
By describing the huge variation 
in sexual behaviour across the 
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animal kingdom, the media may 
be helping to destroy forever the 
idea that gay, lesbian, bisexual 
and transgender people represent 
some kind of aberration. 

Rowan Hooper London, UK. 
rowan.hooper@n ewscientist.com 


A code of conduct for 
European scientists 


Progress in safeguarding research 
integrity is lamentably slow in 
Europe compared with the United 
States (Nature 488, 7; 2012). Many 
European universities are still 
unsure of their responsibilities in 
policing research conduct. 

Punishing misconduct makes 
no sense without prevention, 
given that one study found that 
up to one-third ofall researchers 
engage in questionable research 
practices (D. Fanelli PLoS ONE 
4, e5738; 2009). What is needed 
is for each research institution 
to provide a detailed standard of 
responsible research conduct, 
along with mandatory education, 
so that researchers know how to 
uphold this standard in day-to- 
day practice. 

We suggest that a European 
forum should be set up to educate 
researchers and their mentors 
in all matters relating to this 
standard. The forum would 
develop educational material, 
train and accredit teachers and 
remove discrepancies between 
research institutions. 

Every university would need 
to set up a ‘research integrity 
unit’ to guide and supervise its 
research activities, and to ensure 
compliance with international 
standards. 

We urge public and private 
research funders to lend their 
support to this venture, which 
would help to free European 
research from questionable 
research practices and make life 
harder for fraudsters. 

Nils Axelsen Statens Serum 
Institut, Copenhagen, Denmark. 
Xavier Bosch Department of 
Internal Medicine, Hospital 
Clinic, University of Barcelona, 
Spain. xavbosch@clinic.ub.es 


Wallace pioneered 
astrobiology too 


The field of astrobiology, called 
‘exobiology’ during the space 
programmes of the 1960s 
(Nature 488, 160; 2012), was 
anticipated six decades earlier 
by Alfred Russel Wallace, better 
known as the co-discoverer of 
the principle of natural selection. 

Next year is the centenary 
of Wallace’s death, when 
his seminal contribution to 
biogeography and evolution will 
be celebrated. But he should also 
be remembered as a pioneer in 
astrobiology, whose hypotheses 
are still alive today. 

Wallace introduced the 
concept of “astro-biology” in 
his popular book Mans Place in 
the Universe (Chapman & Hall, 
1903). He reviewed the physical 
conditions required for organic 
life in terrestrial ecosystems and 
concluded that Earth is the only 
habitable planet in the Solar 
System. 

He later published a 
monograph (Is Mars Habitable? 
Macmillan, 1907) evaluating 
astronomer Percival Lowell’s 
suggestion that Mars could be 
“inhabited by a race of highly 
intelligent beings” (see Nature 74, 
587-589; 1906). After analysing 
what was then known about the 
Martian climate, temperature, 
possible presence of water and 
the ‘canals’ thought by Lowell to 
indicate intelligent life, Wallace 
roundly refuted this idea. 

U. Kutschera Institute of Biology, 
University of Kassel, Germany. 
kut@uni-kassel.de 


CORRECTION 

The obituary of Andrew 
Fielding Huxley (Nature 

486, 474; 2012) omitted to 
mention that Hugh Huxley and 
Jean Hanson also proposed 
a sliding-filament model of 
muscle contraction, based on 
myofibril experiments. Their 
work was published (Nature 
173, 973; 1954) alongside 
A. F. Huxley’s Nature paper. 


NEWS & VIEWS 


Mixed results for dieting monkeys 


According to previous studies, a low-calorie diet provides health benefits and increases lifespan in mammals, including 
primates. Yet a long-term investigation in rhesus monkeys finds no effect on longevity. SEE LETTER P.318 


STEVEN N. AUSTAD 


that reducing food consumption 

increases longevity and preserves vari- 
ous aspects of health in mice and rats. For 
many researchers, these observations offered 
an investigational tool for probing under- 
lying mechanisms of the ageing process. For 
other researchers and for the public at large, 
however, the observations suggested the more 
immediate hope that calorie restriction might 
improve health and extend life in humans — 
the fountain of youth could be as close as one’s 
dinner plate. 

To address whether such simple dietary 
intervention might have beneficial effects in 
primates that are long-lived like humans, two 
independent studies using rhesus monkeys 
were initiated in the late 1980s, one at the US 
National Institute on Aging (NIA) in Bethesda, 
Maryland, and the other at the Wisconsin 
National Primate Research Center (WNPRC) 
in Madison. Periodic reports from the studies 
have provided some indications of improved 
health’* and, possibly, increased lifespan’ in 
the animals. However, on page 318 of this 
issue, Mattison et al.* report that the NIA data 
indicate that calorie restriction has no effect on 
the animals’ longevity*. 

In the NIA study, rhesus monkeys of dif- 
ferent ages were subjected to a 30% calorie 
restriction (Fig. 1). The authors grouped the 
data into a young-onset group (animals that 
were 1-14 years old when enrolled in the 
study) and an old-onset group (aged 16-23 
when enrolled). To give perspective to these 
ages, sexual maturity in captive rhesus mon- 
keys occurs at 4-5 years of age, and median 
longevity had previously been reported” 
as 27 years. Mattison et al. found that, for 
both age groups, the treated animals lived 
no longer than untreated monkeys. Nearly 
50% of the young-onset animals are still 
alive, but the researchers’ projections from 
current survival patterns indicate a likelihood 
of less than 0.1% that the calorie-restricted 
animals will survive longer than the controls. 

To explore possible health benefits of the 
treatment, the authors regularly took blood 


Me more than 75 years, we have known 


*This article and the paper under discussion’ were 
published online on 29 August 2012. 


Figure 1 | Thinner but not longer lived. Mattison et al.’ report that restricting calorie intake in rhesus 
monkeys does not increase survival, although it improves certain metabolic parameters in some of 

the animals. The image shows two 27-year-old male monkeys, one of which had been placed on a 
calorie-restricted diet (left), with the other being untreated. (Housing for photo purposes only. 

Photo courtesy of the National Institute on Aging/NIH.) 


samples from the animals to measure levels 
of glucose, cholesterol and triglyceride lipids. 
Cholesterol levels were significantly lower in 
treated males than in untreated males, but were 
unaffected in females. Moreover, the calorie- 
restricted old-onset monkeys had significantly 
lower triglyceride levels, as well as marginally 
lower glucose, than the controls. However, the 
young-onset treated animals showed no dif- 
ference in glucose concentrations, and their 
triglyceride levels were marginally lower in 
males only, when compared with untreated 
controls of the same age group. 

Calorie restriction did not alter causes of 
death. It may be premature to predict the final 
effect on ageing-related diseases because half 
of the animals in the young-onset group are 
still alive, but the dietary intervention seems 
to have reduced the incidence of cancer and, 
possibly, diabetes — although it might have 
slightly increased the incidence of cardio- 
vascular disease. All of these ageing-related 
diseases did appear marginally later in the 
restricted monkeys, however. 

Mattison and colleagues’ results contrast 
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in some ways with those reported for the 
WNPEC study’, which enrolled only young 
rhesus monkeys (7-14 years old). The 
WNPEC researchers found no statistically 
significant effect of calorie restriction on 
overall longevity, but the survival patterns 
(with roughly 50% of the animals still alive) 
suggested a clear trend towards longer life for 
the treated subjects. Moreover, when deaths 
due to non-ageing-related causes such as gas- 
trointestinal bloat or anaesthesia (which was 
required for some experimental procedures) 
were eliminated from the WNPRC analy- 
sis, calorie restriction did show a statistically 
significant, positive effect on survival. 
However, some of us in the field were con- 
cerned about the interpretation of the WNPRC 
study. The deaths that had been eliminated 
from the analysis made up nearly half of the 
total deaths, and there were more of such 
deaths in the restricted group than in the con- 
trol group. So, it was possible that some of the 
non-ageing-related deaths might be linked to 
the treatment. Interestingly, Mattison and col- 
leagues’ data also indicate a higher incidence of 


non-ageing-related deaths in calorie-restricted 
young-onset animals than in controls. 

What might account for the differences 
between the studies’ results? One possibil- 
ity is diet composition. Whereas the diets 
were broadly similar in their overall con- 
tent of carbohydrates, proteins and fats, 
they differed in the specific types of such 
nutrients. For instance, sucrose made up 
28.5% of the WNPRC diet, but only 3.9% 
of the NIA diet. Possibly related to this dif- 
ference, more than 40% of the WNPRC 
control animals and only 12.5% of the NIA 
controls developed diabetes — although 
this metabolic malfunction was completely 
absent in the WNPRC calorie-restricted 
animals, but not in the treated animals in 
the NIA study. 

Another difference is that the NIA controls 
were given an apportioned amount of food to 
prevent obesity, whereas the WNPRC con- 
trols were fed ad libitum (that is, they could 
eat as much as they pleased). Consequently, 
NIA control animals weighed less and were 
considerably longer-lived than the WNPRC 
controls. One interpretation of this observa- 
tion is that the NIA controls were partially 
restricted, which would account for the lack 
ofa survival effect of the treatment in the NIA 
study. Nevertheless, all animals in both stud- 
ies — even in the calorie-restricted groups — 
weighed more than wild-caught monkeys’. 

Taken together, the contrasting results raise 
an intriguing question about the nature and 
robustness of restricting calorie intake in pri- 
mates. Is calorie restriction anything more 
than the elimination of excess fat? That might 
be concluded if one interprets the control ani- 
mals in the NIA study as being restricted to 
a healthy weight, such that further restriction 
had little additional effect on longevity. How- 
ever, researchers who use rodent models have 
long assumed that calorie restriction results 
in more than just leanness and that it extends 
life beyond its normal limits. It is known, for 
instance, that calorie restriction improves sur- 
vival in obese and non-obese mice and rats’, 
even at very high restriction levels®, and that 
survival is affected by calorie restriction in a 
different way from exercise-driven leanness’. 

The same question arises in ongoing 
studies comparing overweight'”"” or lean’* 
people with normal-weight controls. If calorie 
restriction defines only the food intake needed 
to maintain a healthy body weight, then phar- 
macological mimicry of such an effect might 
improve the health of only the fraction of the 
population that is overweight. And, in that 
case — somewhat disappointingly — no spec- 
tacular increase in health or longevity should 
be expected. m 


Steven N. Austad is at the Barshop 
Institute for Longevity and Aging Studies, 
Department of Cellular and Structural 
Biology, University of Texas Health Science 


Center at San Antonio, San Antonio, 
Texas 78245, USA. 
e-mail: austad@uthscsa.edu 
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Drought and tropical 
soil emissions 


Past research implied that positive feedback might exist between climate change 
and greenhouse-gas emissions from soil. A study finds that drought-induced 
declines in such emissions from tropical forests could counter climate change. 


CORY C. CLEVELAND 
& BENJAMIN W. SULLIVAN 


ropical forests have been aptly described 

as the planet's lungs. Tropical vegetation 

inhales vast amounts of carbon dioxide 
and exhales oxygen to the atmosphere, and 
in so doing, provides a vital climate benefit 
by absorbing some of the nearly 9 petagrams 
(1 petagram is 10’ g) of carbon that human 
activities release into the atmosphere each year’. 
What's more, tropical soils influence climate by 
releasing large amounts of CO, as a by-product 
of the decomposition of organic matter, and 
through several other soil microbial processes 
that both emit and consume greenhouse gases, 
including nitrous oxide and methane’. 

Important questions remain about the 
effect of climate change on greenhouse-gas 
fluxes from soil, and hence to what extent the 
beneficial climate services provided by tropi- 
cal forests will persist in the future. Writing 
in Global Biogeochemical Cycles, Wood and 
Silver’ report that an experimentally induced 
drought had reduced greenhouse-gas emis- 
sions from tropical soil to the atmosphere, an 
effect that represents a negative feedback to 
climate change. 

Many of the scenarios used to predict future 
climate focus on rising temperature, but 
changes in the water balance of tropical forests 
might have more immediate and pronounced 
effects. Although substantial uncertainty 
remains, most climate models predict signifi- 
cant declines in rainfall across large areas of 
the tropics*”. So how might such declines alter 
the net balance of the gases that contribute to 
climate change? 

To address this question, Wood and Silver 
manipulated rainfall in a Puerto Rican tropical 


forest (Fig. 1) to test how drought affects soil 
emissions of CO,, methane and nitrous oxide. 
Such experimental manipulations remain the 
gold standard for directly assessing the poten- 
tial effects of environmental change on eco- 
system processes, and can provide critical input 
to the development of better Earth-system 
models. The authors simulated the effects of 
a prolonged dry season by preventing rainfall 
from reaching the forest floor for three months, 
and measured greenhouse-gas emissions from 
soil before, during and after the experiment. 
After the simulated drought, Wood and 
Silver documented profound declines in the 
efflux of CO, from soil and increases in soil 
consumption of methane. In addition, the 
drought elicited surprising reductions in 
denitrification (the production of nitrous 
oxide), a process that is carried out by a tiny 
fraction of the soil’s microbial community. 
Together, these effects lowered the global- 
warming potential of the soil emissions. When 
the heat-trapping ability of nitrous oxide is 
taken into account, a substantial reduction in 
total global-warming potential from all three 
gases was accounted for by the decline of 
denitrification. Because tropical forest soils are 
a major source of nitrous oxide emissions to 
the atmosphere’, this result implies that a drier 
climate could cause substantial reductions in 
global emissions of nitrous oxide from soil. 
However, extrapolating results from experi- 
ments such as these is particularly challenging 
for tropical forests. In Wood and Silver’s 
experiment, drought effects varied depending 
on landscape position: ridge and valley plots 
had larger responses to experimental drought 
than did those on slopes. This variation has 
considerable implications for models used to 
simulate biogeochemical responses to climate 
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non-ageing-related deaths in calorie-restricted 
young-onset animals than in controls. 

What might account for the differences 
between the studies’ results? One possibil- 
ity is diet composition. Whereas the diets 
were broadly similar in their overall con- 
tent of carbohydrates, proteins and fats, 
they differed in the specific types of such 
nutrients. For instance, sucrose made up 
28.5% of the WNPRC diet, but only 3.9% 
of the NIA diet. Possibly related to this dif- 
ference, more than 40% of the WNPRC 
control animals and only 12.5% of the NIA 
controls developed diabetes — although 
this metabolic malfunction was completely 
absent in the WNPRC calorie-restricted 
animals, but not in the treated animals in 
the NIA study. 

Another difference is that the NIA controls 
were given an apportioned amount of food to 
prevent obesity, whereas the WNPRC con- 
trols were fed ad libitum (that is, they could 
eat as much as they pleased). Consequently, 
NIA control animals weighed less and were 
considerably longer-lived than the WNPRC 
controls. One interpretation of this observa- 
tion is that the NIA controls were partially 
restricted, which would account for the lack 
ofa survival effect of the treatment in the NIA 
study. Nevertheless, all animals in both stud- 
ies — even in the calorie-restricted groups — 
weighed more than wild-caught monkeys’. 

Taken together, the contrasting results raise 
an intriguing question about the nature and 
robustness of restricting calorie intake in pri- 
mates. Is calorie restriction anything more 
than the elimination of excess fat? That might 
be concluded if one interprets the control ani- 
mals in the NIA study as being restricted to 
a healthy weight, such that further restriction 
had little additional effect on longevity. How- 
ever, researchers who use rodent models have 
long assumed that calorie restriction results 
in more than just leanness and that it extends 
life beyond its normal limits. It is known, for 
instance, that calorie restriction improves sur- 
vival in obese and non-obese mice and rats’, 
even at very high restriction levels®, and that 
survival is affected by calorie restriction in a 
different way from exercise-driven leanness’. 

The same question arises in ongoing 
studies comparing overweight'”"” or lean’* 
people with normal-weight controls. If calorie 
restriction defines only the food intake needed 
to maintain a healthy body weight, then phar- 
macological mimicry of such an effect might 
improve the health of only the fraction of the 
population that is overweight. And, in that 
case — somewhat disappointingly — no spec- 
tacular increase in health or longevity should 
be expected. m 
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Drought and tropical 
soil emissions 


Past research implied that positive feedback might exist between climate change 
and greenhouse-gas emissions from soil. A study finds that drought-induced 
declines in such emissions from tropical forests could counter climate change. 


CORY C. CLEVELAND 
& BENJAMIN W. SULLIVAN 


ropical forests have been aptly described 

as the planet's lungs. Tropical vegetation 

inhales vast amounts of carbon dioxide 
and exhales oxygen to the atmosphere, and 
in so doing, provides a vital climate benefit 
by absorbing some of the nearly 9 petagrams 
(1 petagram is 10’ g) of carbon that human 
activities release into the atmosphere each year’. 
What's more, tropical soils influence climate by 
releasing large amounts of CO, as a by-product 
of the decomposition of organic matter, and 
through several other soil microbial processes 
that both emit and consume greenhouse gases, 
including nitrous oxide and methane’. 

Important questions remain about the 
effect of climate change on greenhouse-gas 
fluxes from soil, and hence to what extent the 
beneficial climate services provided by tropi- 
cal forests will persist in the future. Writing 
in Global Biogeochemical Cycles, Wood and 
Silver’ report that an experimentally induced 
drought had reduced greenhouse-gas emis- 
sions from tropical soil to the atmosphere, an 
effect that represents a negative feedback to 
climate change. 

Many of the scenarios used to predict future 
climate focus on rising temperature, but 
changes in the water balance of tropical forests 
might have more immediate and pronounced 
effects. Although substantial uncertainty 
remains, most climate models predict signifi- 
cant declines in rainfall across large areas of 
the tropics*”. So how might such declines alter 
the net balance of the gases that contribute to 
climate change? 

To address this question, Wood and Silver 
manipulated rainfall in a Puerto Rican tropical 


forest (Fig. 1) to test how drought affects soil 
emissions of CO,, methane and nitrous oxide. 
Such experimental manipulations remain the 
gold standard for directly assessing the poten- 
tial effects of environmental change on eco- 
system processes, and can provide critical input 
to the development of better Earth-system 
models. The authors simulated the effects of 
a prolonged dry season by preventing rainfall 
from reaching the forest floor for three months, 
and measured greenhouse-gas emissions from 
soil before, during and after the experiment. 
After the simulated drought, Wood and 
Silver documented profound declines in the 
efflux of CO, from soil and increases in soil 
consumption of methane. In addition, the 
drought elicited surprising reductions in 
denitrification (the production of nitrous 
oxide), a process that is carried out by a tiny 
fraction of the soil’s microbial community. 
Together, these effects lowered the global- 
warming potential of the soil emissions. When 
the heat-trapping ability of nitrous oxide is 
taken into account, a substantial reduction in 
total global-warming potential from all three 
gases was accounted for by the decline of 
denitrification. Because tropical forest soils are 
a major source of nitrous oxide emissions to 
the atmosphere’, this result implies that a drier 
climate could cause substantial reductions in 
global emissions of nitrous oxide from soil. 
However, extrapolating results from experi- 
ments such as these is particularly challenging 
for tropical forests. In Wood and Silver’s 
experiment, drought effects varied depending 
on landscape position: ridge and valley plots 
had larger responses to experimental drought 
than did those on slopes. This variation has 
considerable implications for models used to 
simulate biogeochemical responses to climate 
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Figure 1 | Luquillo Experimental Forest. Wood and Silver’ have studied the effect of drought on 
greenhouse gases emitted from soils in the Luquillo Experimental Forest, Puerto Rico. 


change, none of which is able to capture the 
complex interplay of factors that can create 
substantial biogeochemical heterogeneity 
within and across tropical forests’. The 
authors’ results are a prime example of the 
challenge facing scientists — many tropical 
regions have highly complex landscapes with 
varying nutrient availability, which in turn can 
regulate biological processes that influence 
greenhouse-gas production*”. 

The possibility of positive feedback between 
climate change and greenhouse-gas emissions 
from soil has been recognized for decades. 
Wood and Silver's results suggest the opposite: 
declines in greenhouse-gas emissions follow- 
ing drought would reduce climate forcing. But 
as the authors highlight, the handful of studies 
in which rainfall has been excluded from tropi- 
cal forests have shown positive, negative or no 
net effects of drought on soil greenhouse-gas 
emissions. Moreover, although it is crucial to 
consider the effects of drought alone, real- 
world emissions will hinge on the combined 
effects of changing precipitation and tempera- 
ture, along with chronic shifts in atmospheric 
CO, levels and nutrient deposition — factors 
that were not manipulated in the authors’ 
experiment. Finally, studies'®'' have shown 
that the growth and carbon uptake of trees 
in the tropics are highly sensitive to climate, 
but the experimental plots used in Wood and 
Silver’s study (1.54 square metres) were not 
large enough to simulate the potentially 
negative effects of drought on carbon uptake 
through tree growth. The overall effects of 
drought on the greenhouse-gas balance of the 
sites therefore remain unknown. 

In recent decades, tropical forests have given 
us a discount on anthropogenic CO, emis- 
sions by absorbing more greenhouse gases 
than human activity produces. Wood and Sil- 
ver's findings suggest that tropical soils may 
continue to offset greenhouse-gas emis- 
sions during drought. Perhaps more notably, 
their study highlights the need for additional 


large-scale experiments that can more com- 
pletely resolve the potential effects of climate 
change on trace gas emissions in tropical forests. 
The authors’ data are critical for improving and 
validating models that predict ecosystem and 
climate responses over large spatial and tem- 
poral scales. Considering the pivotal role of 
tropical ecosystems in basic human health and 


welfare, our understanding of fundamental eco- 
system processes, and their potential response 
to climate change, remains woefully incomplete 
in complex and diverse tropical forests. m 
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Poked to vote 


A Facebook message sent out during the 2010 US congressional elections influenced 
the voting behaviour of millions of people. The experiment illustrates the power of 
digital social networks to spread behavioural change. SEE LETTER P.295 


SINAN ARAL 


ocial networks are the pathways through 

which information, advice, resources and 

support flow between people. They are 
essential for many people’s decision-making, 
cooperation and complex interdependence. 
Yet although humans have almost always 
lived in networks, advances in computing 
power and new social technologies have only 
recently facilitated the development of forms 
of networked communication that are auto- 
mating and accelerating the social signals 
that pulse through the human network on a 
daily basis. The rapid dissemination of social 
signals in these digital networks — status 
updates, tweets, likes, posts, shares and so 
on — raises serious scientific questions: how, 
when and to what extent do these signals 
influence decision-making and the spread of 
behaviours in society? If social influence drives 
behaviour, then digital social signals could be 
used to promote widespread behaviour change 
and thus to transform commerce, politics and 
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public health. On page 295 of this issue, Bond 
et al.' present some of the most convincing evi- 
dence to date that peer influence and digital 
social signals can affect political mobilization. 

Political mobilization has been central to 
recent discourse about the transformative 
effects of social media — for example, the part 
that technologies such as Facebook or Twitter 
played in the protests collectively known as the 
Arab Spring, or may play in the forthcoming 
US presidential election. The question is: what 
role do peer influence and digital social signals 
have in mobilizing political expression? Do our 
friends’ behaviours inspire us to be politically 
active, to protest or to vote? 

These questions may seem relatively simple 
to answer, given the right data. But several sta- 
tistical challenges make it difficult to quanti- 
tatively estimate peer influence in networks. 
For example, networks are homophilous — we 
tend to make friends with people like ourselves 
and thus have preferences that are highly cor- 
related with those of our friends’. If two friends 
adopt a behaviour, one immediately after the 
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Figure 1 | Luquillo Experimental Forest. Wood and Silver’ have studied the effect of drought on 
greenhouse gases emitted from soils in the Luquillo Experimental Forest, Puerto Rico. 


change, none of which is able to capture the 
complex interplay of factors that can create 
substantial biogeochemical heterogeneity 
within and across tropical forests’. The 
authors’ results are a prime example of the 
challenge facing scientists — many tropical 
regions have highly complex landscapes with 
varying nutrient availability, which in turn can 
regulate biological processes that influence 
greenhouse-gas production*”. 

The possibility of positive feedback between 
climate change and greenhouse-gas emissions 
from soil has been recognized for decades. 
Wood and Silver's results suggest the opposite: 
declines in greenhouse-gas emissions follow- 
ing drought would reduce climate forcing. But 
as the authors highlight, the handful of studies 
in which rainfall has been excluded from tropi- 
cal forests have shown positive, negative or no 
net effects of drought on soil greenhouse-gas 
emissions. Moreover, although it is crucial to 
consider the effects of drought alone, real- 
world emissions will hinge on the combined 
effects of changing precipitation and tempera- 
ture, along with chronic shifts in atmospheric 
CO, levels and nutrient deposition — factors 
that were not manipulated in the authors’ 
experiment. Finally, studies'®'' have shown 
that the growth and carbon uptake of trees 
in the tropics are highly sensitive to climate, 
but the experimental plots used in Wood and 
Silver’s study (1.54 square metres) were not 
large enough to simulate the potentially 
negative effects of drought on carbon uptake 
through tree growth. The overall effects of 
drought on the greenhouse-gas balance of the 
sites therefore remain unknown. 

In recent decades, tropical forests have given 
us a discount on anthropogenic CO, emis- 
sions by absorbing more greenhouse gases 
than human activity produces. Wood and Sil- 
ver's findings suggest that tropical soils may 
continue to offset greenhouse-gas emis- 
sions during drought. Perhaps more notably, 
their study highlights the need for additional 


large-scale experiments that can more com- 
pletely resolve the potential effects of climate 
change on trace gas emissions in tropical forests. 
The authors’ data are critical for improving and 
validating models that predict ecosystem and 
climate responses over large spatial and tem- 
poral scales. Considering the pivotal role of 
tropical ecosystems in basic human health and 


welfare, our understanding of fundamental eco- 
system processes, and their potential response 
to climate change, remains woefully incomplete 
in complex and diverse tropical forests. m 
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Poked to vote 


A Facebook message sent out during the 2010 US congressional elections influenced 
the voting behaviour of millions of people. The experiment illustrates the power of 
digital social networks to spread behavioural change. SEE LETTER P.295 
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ocial networks are the pathways through 

which information, advice, resources and 

support flow between people. They are 
essential for many people’s decision-making, 
cooperation and complex interdependence. 
Yet although humans have almost always 
lived in networks, advances in computing 
power and new social technologies have only 
recently facilitated the development of forms 
of networked communication that are auto- 
mating and accelerating the social signals 
that pulse through the human network on a 
daily basis. The rapid dissemination of social 
signals in these digital networks — status 
updates, tweets, likes, posts, shares and so 
on — raises serious scientific questions: how, 
when and to what extent do these signals 
influence decision-making and the spread of 
behaviours in society? If social influence drives 
behaviour, then digital social signals could be 
used to promote widespread behaviour change 
and thus to transform commerce, politics and 
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public health. On page 295 of this issue, Bond 
et al.' present some of the most convincing evi- 
dence to date that peer influence and digital 
social signals can affect political mobilization. 

Political mobilization has been central to 
recent discourse about the transformative 
effects of social media — for example, the part 
that technologies such as Facebook or Twitter 
played in the protests collectively known as the 
Arab Spring, or may play in the forthcoming 
US presidential election. The question is: what 
role do peer influence and digital social signals 
have in mobilizing political expression? Do our 
friends’ behaviours inspire us to be politically 
active, to protest or to vote? 

These questions may seem relatively simple 
to answer, given the right data. But several sta- 
tistical challenges make it difficult to quanti- 
tatively estimate peer influence in networks. 
For example, networks are homophilous — we 
tend to make friends with people like ourselves 
and thus have preferences that are highly cor- 
related with those of our friends’. If two friends 
adopt a behaviour, one immediately after the 


BOX1 
Networking behaviour 
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The images shown display the uptake of 

a personalized mobile-phone application 
across a network of Yahoo! customers 
communicating over the company’s instant- 
messenger service®. Each panel displays a 
snapshot of one cluster of the network at 

33 days (a), 41 days (b) and 62 days (c) 
after the application’s launch. The four large 
black nodes represent customers who were 
initial adopters of the application; yellow 
nodes are new adopters in each period; red 


other, it is difficult to distinguish whether this 
is due to peer influence or homophily. Network 
peers are also likely to be exposed to similar 
environmental stimuli: we work together and 
thus receive the same employee incentives; we 
live together and so are exposed to the same 
neighbourhood events; and because we have 
the same preferences, we tend to browse the 
same websites and watch the same television 
shows, so we see the same advertisements. 
This correlated exposure to external stimuli 
could also contribute to correlations in peer 
behaviours over time. 

Methods are now available that can sepa- 
rate peer influence from homophily and other 
confounding factors in observational data’. 
This is useful because most data on these 
questions are observational. But controlling 
for unobservable factors, such as latent homo- 
phily, remains difficult*. As an alternative to 
observational data, experimental network 
studies using random assignment can provide 
a more robust means of identifying causal 
peer effects in networks and distinguishing 
between influence and confounding factors. 
Such experiments have demonstrated a role for 
peer influence in product adoption”’ (Box 1), 
health behaviours® and altruism’. 

However, although it has been postulated 
that voting is also motivated by interactions 
with friends, neighbours and family mem- 
bers’®, most research and many ‘get out the 


nodes are those who adopted in previous 
periods; and blue nodes are non-adopters. 
Orange lines highlight connections between 
previous adopters and new adopters that are 
activated as the product diffuses through the 
population. 

A key question is how much the spread 
of behaviours such as product adoption 
is due to social influence — or homophily 
(correlated preferences among friends) 
— and/or other confounding factors. 


vote’ mobilization campaigns have focused on 
the individual’ (for an exception, see ref. 13 
for experimental evidence of peer influence on 
voting behaviour in two-person households). 
By contrast, Bond and colleagues used a ran- 
domized experiment — involving 61 million 
people within the intricate social network of 
Facebook — to show that digital social signals 
directly influence political self-expression, 
information-seeking and real-world voting 
behaviour. 

The experiment was conducted during the 
2010 US congressional elections. The targeted 
Facebook users saw a statement at the top of 
their “News Feed’ that encouraged them to 
vote, provided a link to information on local 
polling places, contained a clickable ‘I voted’ 
button and showed a counter of Facebook 
users who had voted (see Fig. 1 of the paper’). 
This was the ‘informational’ message, received 
by a control group of users. Another group 
received a ‘social’ message, which addition- 
ally included six randomly selected profile 
pictures of Facebook friends of the user who 
had already clicked the ‘I voted’ button. The 
researchers categorized these friends into 
either ordinary friends or close friends on the 
basis of the degree of Facebook interaction. 

The results show that users who received 
the social message were 2.08% more likely 
to report that they themselves voted, 0.26% 
more likely to seek information about a polling 


Such confounding factors mimic the 
influence that the spread of behaviours 

in social networks has on the resulting 
diffusion patterns. Separating that influence 
from homophily and other factors, and 
establishing estimates of the degree to 
which peer-induced action is responsible for 
the adoption of a behaviour, are essential 
for creating effective policies that promote 
positive behaviours in society and contain 
negative ones. S.A. 


location, and 0.39% more likely to actually 
vote than users receiving the informational 
message (the authors estimated actual voting 
using data on 6.3 million users in their sam- 
ple that could be matched to publicly available 
voting records). The authors also examined 
‘contagion’ effects of the messages on users who 
themselves received neither the social nor the 
informational message, but who had a friend 
who received a message. Here, in the validated 
voting sample, individuals were 0.224% 
more likely to vote, for each close friend who 
received a message, than they would have been 
had their friend received no message. 

Although these estimates may seem small, 
they translate into significant numbers of 
votes. A social message saying that a Facebook 
friend had voted generated 886,000 additional 
‘expressed’ votes (clicks on the ‘I voted’ button), 
and messages involving a close friend gener- 
ated an additional 559,000 expressed votes. In 
terms of real-world voting, the authors were 
able to validate 282,000 additional votes cast 
by people receiving a message that a close 
Facebook friend had voted. 

This work has important implications for 
our understanding of how signals in social 
networks influence an individual's behaviour. 
But there is more work to be done. In par- 
ticular, future work modelling the degree to 
which contamination or leakage in networked 
experiments’ (also called interference") affects 
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inference, examination of how peer influence 
varies across behaviours’, and evaluation of the 
social and structural conditions under which 
influence is more or less likely to propagate 
will all be essential to our understanding of the 
spread of behaviour change through human 
populations. 

Advancing our understanding of peer 
influence in networks is the first step towards 
designing ‘network interventions’ that can 
promote positive behaviours in human popu- 
lations, or contain negative ones’. It is perhaps 
obvious that this is relevant to, for example, 
targeted advertising and viral marketing. But 
such interventions also have the potential 
to promote positive social changes, such as 


ORGANIC SYNTHESIS 


increasing the rate of HIV testing, reducing 
violence, improving adherence to exercise, or 
increasing political mobilization and aware- 
ness. In this way, the science of social influence 
may have dramatic implications for products, 
politics and public health. = 
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A biochemical 
messenger made easily 


Biochemicals known as prostaglandins are challenging targets for synthetic 
organic chemistry. Yet by channelling the reactivity of a simple reactant, a 
powerful synthesis of one such compound has been achieved. SEE LETTER P.278 


ERIK J. SORENSEN 


hemical messengers called prosta- 

glandins are present in nearly all 

mammalian tissues. These elusive mol- 
ecules mediate an extraordinary number of 
biological processes — including the regula- 
tion of body temperature, the contraction and 
relaxation of the human uterus, the aggregation 
of platelets in blood and cellular responses to 
inflammation. They have therefore been the tar- 
gets of wide-ranging research’ since the 1930s. 


1988 


ee aes 


In particular, their unique molecular archi- 
tectures and great therapeutic potential have 
fired the creative imagination of synthetic 
organic chemists**. Writing on page 278 
of this issue, Coulthard et al.* report one of 
the cleverest syntheses of one such molecule, 
prostaglandin F,,, (PGF,,), to date. The con- 
ciseness of their approach may open up new 
opportunities for drug discovery. 
Prostaglandins contain two fat-soluble 
hydrocarbon chains on opposite sides ofa ring 
of five carbon atoms (Fig. 1). Nature synthesizes 


1986 
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Succindialdehyde 


Figure 1 | Landmark syntheses of prostaglandin F,,, (PGF,,). The PGF,,, biochemical has long 
acted as a testing ground for organic synthesis. The dates of selected syntheses” ’ are indicated, along 
with the molecule that acted as the starting point or key intermediate for each synthesis. Coulthard 

et al.’ now report a concise synthesis of PGF,, that starts from succindialdehyde. Ac is acetyl, COCH,; 
Me is methyl, CH,; Et is ethyl, C,H,; ‘Bu is tertiary butyl, (CH;),C. 
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PGFE,, and its analogues from arachidonic acid 
(a polyunsaturated fatty acid) and two mol- 
ecules of oxygen, with a little help from some 
key enzymes. Although nearly every nucleated 
cell is capable of biosynthesizing prostaglan- 
dins, these compounds are short-lived and 
exceedingly difficult to isolate from biological 
samples. Since the late 1960s, chemists have 
therefore devised creative strategies for pro- 
ducing prostaglandins from simple chemicals 
in the laboratory. The development of depend- 
able laboratory syntheses has contributed sub- 
stantially to our knowledge of the compounds’ 
remarkable range of pharmacological proper- 
ties, as well as their relevance to human health. 

Ideas about the preparation of PGF,, have 
resulted in a diversity of pathways” ° — a diver- 
sity that demonstrates how well the principles 
of organic chemistry can be used to plan and 
execute synthetic routes to structurally com- 
plex molecules (Fig. 1). The approach now 
reported by Coulthard et al. will be praised for 
its remarkable brevity and for the bold strat- 
egy that guided it. The structural relationship 
between the simple starting material, succin- 
dialdehyde, and the coveted target is distant, 
and yet the authors perceived that a union of 
two molecules of succindialdehyde in a single 
laboratory operation might produce an inter- 
mediate hemiacetal compound that is tantaliz- 
ingly close to the structure of PGF,, (Fig. 2). 

The risky aspect of the authors’ strategy 
concerns the intrinsic reactivity of succin- 
dialdehyde and its potential to take part in 
undesired, polymer-forming reactions. Few 
chemists, even those of adventurous spirit, 
would have believed that a useful synthesis of 
the hemiacetal could be achieved as Coulthard 
et al. anticipated, through the direct pairing of 
two molecules of succindialdehyde in sequen- 
tial carbon-carbon-bond-forming reactions 
known as aldol reactions. The great thing is 
that, not only did this direct approach work 
well, but it also achieved high stereoselectivity 
— it produced mostly one mirror-image isomer 
(enantiomer) of the product. This is crucial, 
because the biological activity of prostaglandins 
depends on their enantiomeric form. 
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varies across behaviours’, and evaluation of the 
social and structural conditions under which 
influence is more or less likely to propagate 
will all be essential to our understanding of the 
spread of behaviour change through human 
populations. 

Advancing our understanding of peer 
influence in networks is the first step towards 
designing ‘network interventions’ that can 
promote positive behaviours in human popu- 
lations, or contain negative ones’. It is perhaps 
obvious that this is relevant to, for example, 
targeted advertising and viral marketing. But 
such interventions also have the potential 
to promote positive social changes, such as 
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increasing the rate of HIV testing, reducing 
violence, improving adherence to exercise, or 
increasing political mobilization and aware- 
ness. In this way, the science of social influence 
may have dramatic implications for products, 
politics and public health. = 
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A biochemical 
messenger made easily 


Biochemicals known as prostaglandins are challenging targets for synthetic 
organic chemistry. Yet by channelling the reactivity of a simple reactant, a 
powerful synthesis of one such compound has been achieved. SEE LETTER P.278 


ERIK J. SORENSEN 


hemical messengers called prosta- 

glandins are present in nearly all 

mammalian tissues. These elusive mol- 
ecules mediate an extraordinary number of 
biological processes — including the regula- 
tion of body temperature, the contraction and 
relaxation of the human uterus, the aggregation 
of platelets in blood and cellular responses to 
inflammation. They have therefore been the tar- 
gets of wide-ranging research’ since the 1930s. 
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In particular, their unique molecular archi- 
tectures and great therapeutic potential have 
fired the creative imagination of synthetic 
organic chemists**. Writing on page 278 
of this issue, Coulthard et al.* report one of 
the cleverest syntheses of one such molecule, 
prostaglandin F,,, (PGF,,), to date. The con- 
ciseness of their approach may open up new 
opportunities for drug discovery. 
Prostaglandins contain two fat-soluble 
hydrocarbon chains on opposite sides ofa ring 
of five carbon atoms (Fig. 1). Nature synthesizes 
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Figure 1 | Landmark syntheses of prostaglandin F,,, (PGF,,). The PGF,,, biochemical has long 
acted as a testing ground for organic synthesis. The dates of selected syntheses” ’ are indicated, along 
with the molecule that acted as the starting point or key intermediate for each synthesis. Coulthard 

et al.’ now report a concise synthesis of PGF,, that starts from succindialdehyde. Ac is acetyl, COCH,; 
Me is methyl, CH,; Et is ethyl, C,H,; ‘Bu is tertiary butyl, (CH;),C. 
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PGFE,, and its analogues from arachidonic acid 
(a polyunsaturated fatty acid) and two mol- 
ecules of oxygen, with a little help from some 
key enzymes. Although nearly every nucleated 
cell is capable of biosynthesizing prostaglan- 
dins, these compounds are short-lived and 
exceedingly difficult to isolate from biological 
samples. Since the late 1960s, chemists have 
therefore devised creative strategies for pro- 
ducing prostaglandins from simple chemicals 
in the laboratory. The development of depend- 
able laboratory syntheses has contributed sub- 
stantially to our knowledge of the compounds’ 
remarkable range of pharmacological proper- 
ties, as well as their relevance to human health. 

Ideas about the preparation of PGF,, have 
resulted in a diversity of pathways” ° — a diver- 
sity that demonstrates how well the principles 
of organic chemistry can be used to plan and 
execute synthetic routes to structurally com- 
plex molecules (Fig. 1). The approach now 
reported by Coulthard et al. will be praised for 
its remarkable brevity and for the bold strat- 
egy that guided it. The structural relationship 
between the simple starting material, succin- 
dialdehyde, and the coveted target is distant, 
and yet the authors perceived that a union of 
two molecules of succindialdehyde in a single 
laboratory operation might produce an inter- 
mediate hemiacetal compound that is tantaliz- 
ingly close to the structure of PGF,, (Fig. 2). 

The risky aspect of the authors’ strategy 
concerns the intrinsic reactivity of succin- 
dialdehyde and its potential to take part in 
undesired, polymer-forming reactions. Few 
chemists, even those of adventurous spirit, 
would have believed that a useful synthesis of 
the hemiacetal could be achieved as Coulthard 
et al. anticipated, through the direct pairing of 
two molecules of succindialdehyde in sequen- 
tial carbon-carbon-bond-forming reactions 
known as aldol reactions. The great thing is 
that, not only did this direct approach work 
well, but it also achieved high stereoselectivity 
— it produced mostly one mirror-image isomer 
(enantiomer) of the product. This is crucial, 
because the biological activity of prostaglandins 
depends on their enantiomeric form. 
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Figure 2 | A concise synthesis of PGF,,.. a, In Coulthard and colleagues’ 
synthesis’, two equivalents of succindialdehyde react with each other 

to produce a hemiacetal when treated with catalytic L-proline, and then 
with another catalyst (an amine salt, [Bn, NH,]"[OCOCEF,] ), in the same 
reaction vessel. The product is obtained in 20% yield as a 99:1 ratio of 
enantiomers (mirror-image isomers); the major isomer is depicted. 


The authors discovered that the sequential 
addition of two organic catalysts to a solution 
of succindialdehyde is key to the formation of 
the desired hemiacetal. The first catalyst was 
the amino acid proline, and the second was an 
amine salt. The development of this process 
was supported by earlier studies of proline- 
catalysed aldol reactions'”"” and ring-form- 
ing aldol reactions induced by amine salts” at 
room temperature. 

Although the yield of hemiacetal isolated 
from the reaction was moderate (20%), Coul- 
thard et al. removed most of the undesired 
oligomeric side products by filtration. A sim- 
ple chromatographic purification step then 
afforded the product as the major component 
of a 99:1 mixture of enantiomers. Any possi- 
ble concerns about the yield for this impressive 
transformation are assuaged by the authors’ 
demonstration that 15 grams of a key prosta- 
glandin building block (an acetal; Fig. 2) can 
be produced in only two steps from succin- 
dialdehyde — a large quantity to take forward 
for the rest of the synthesis. 

With its ring of five carbon atoms and newly 
formed stereogenic centres (carbon atoms 
around which different geometric arrange- 
ments of bonds generate different enantiomers), 
the acetal building block is a close approxima- 
tion of the core structure of the prostaglandins. 
By exploiting the reactive alkene group and the 
rigid molecular shape of the building block, 
Coulthard et al. attached one of the two hydro- 
carbon side chains of PGF,, to the desired site 
on the five-membered ring. Then, in the final 
phase of the synthesis, the authors efficiently 
connected the second side chain using a reac- 
tion developed by E. J. Corey in his influential 
syntheses of prostaglandins’. In all, only seven 
operations were needed to transmute the sim- 
ple compound succindialdehyde into PGF,,. 
For comparison, Corey’s synthesis required 
17 steps, starting from the simple molecule 
cyclopentadiene. 

Finding a way to pre- 
pare prostaglandins and 
their analogues in a few 
steps from commonplace 
chemicals is still highly 


> NATURE.COM 
For more on 
prostaglandin 
synthesis, see: 
go.nature.com/thqz85 


alluring for many researchers, especially those 
in the pharmaceutical industry. The brevity of 
Coulthard and colleagues’ synthesis of PGF,, 
sets a new standard, and their approach should 
be applicable to other members of the prosta- 
glandin class, as well as to new structures that 
have some of the molecular features of prosta- 
glandins. The authors’ achievement also calls 
to mind a much earlier groundbreaking use 
of succindialdehyde in chemical synthesis — 
Robert Robinson's synthesis of tropinone”™, a 
precursor to the drug atropine, in 1917. Some 
of the power and simplicity of that classic 
work is seen in Coulthard and colleagues’ 
remarkable solution to the problem of building 
prostaglandins. m 
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Oligomeric side products are also formed, but are easily removed by 
filtration. Bn is benzyl, CH,C,H:. b, The hemiacetal is treated with 
methanol (CH;OH), an acid (Amberlyst-15) and magnesium sulphate 
(MgSO,) to yield an acetal, which contains most of the features needed for 
the ring component of PGF.,,. c, The authors completed the synthesis by 
attaching two side chains to the acetal. 
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Dormant and restless 
skin stem cells 


It has been unclear whether a uniform group of stem cells gives rise to most 
cells in the epidermis. A study reveals the presence of at least two stem-cell 
populations that have different proliferative abilities. SEE ARTICLE P.257 


LAURA DE ROSA & MICHELE DE LUCA 


Il renewing tissues, such as blood and 

skin, are sustained and repaired by a 

small population of resident stem cells. 
These cells have the ability to self-renew and to 
generate committed progenitor cells that dif- 
ferentiate into the cell lineages of the tissue of 
origin. However, the nature and the specific 
activities of stem and progenitor cells at dif- 
ferent body sites are a matter of debate. For 
example, several theories have been proposed 


regarding the epidermis, the outer covering 
of the skin (Fig. 1). On page 257 of this issue, 
Mascré et al.' provide compelling evidence for 
the existence of two cell populations that dif- 
fer in their proliferative dynamics, their gene- 
expression profile and their ability to repair the 
epidermis after injury. 

To investigate the origin, location, prolif- 
eration and fate of stem cells in mouse-tail 
epidermis, the authors used genetic lineage 
tracing — a technique that allows in vivo 
fluorescent marking of specific cell types and 
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synthesis’, two equivalents of succindialdehyde react with each other 

to produce a hemiacetal when treated with catalytic L-proline, and then 
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The authors discovered that the sequential 
addition of two organic catalysts to a solution 
of succindialdehyde is key to the formation of 
the desired hemiacetal. The first catalyst was 
the amino acid proline, and the second was an 
amine salt. The development of this process 
was supported by earlier studies of proline- 
catalysed aldol reactions'”"” and ring-form- 
ing aldol reactions induced by amine salts” at 
room temperature. 

Although the yield of hemiacetal isolated 
from the reaction was moderate (20%), Coul- 
thard et al. removed most of the undesired 
oligomeric side products by filtration. A sim- 
ple chromatographic purification step then 
afforded the product as the major component 
of a 99:1 mixture of enantiomers. Any possi- 
ble concerns about the yield for this impressive 
transformation are assuaged by the authors’ 
demonstration that 15 grams of a key prosta- 
glandin building block (an acetal; Fig. 2) can 
be produced in only two steps from succin- 
dialdehyde — a large quantity to take forward 
for the rest of the synthesis. 

With its ring of five carbon atoms and newly 
formed stereogenic centres (carbon atoms 
around which different geometric arrange- 
ments of bonds generate different enantiomers), 
the acetal building block is a close approxima- 
tion of the core structure of the prostaglandins. 
By exploiting the reactive alkene group and the 
rigid molecular shape of the building block, 
Coulthard et al. attached one of the two hydro- 
carbon side chains of PGF,, to the desired site 
on the five-membered ring. Then, in the final 
phase of the synthesis, the authors efficiently 
connected the second side chain using a reac- 
tion developed by E. J. Corey in his influential 
syntheses of prostaglandins’. In all, only seven 
operations were needed to transmute the sim- 
ple compound succindialdehyde into PGF,,. 
For comparison, Corey’s synthesis required 
17 steps, starting from the simple molecule 
cyclopentadiene. 

Finding a way to pre- 
pare prostaglandins and 
their analogues in a few 
steps from commonplace 
chemicals is still highly 
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alluring for many researchers, especially those 
in the pharmaceutical industry. The brevity of 
Coulthard and colleagues’ synthesis of PGF,, 
sets a new standard, and their approach should 
be applicable to other members of the prosta- 
glandin class, as well as to new structures that 
have some of the molecular features of prosta- 
glandins. The authors’ achievement also calls 
to mind a much earlier groundbreaking use 
of succindialdehyde in chemical synthesis — 
Robert Robinson's synthesis of tropinone”™, a 
precursor to the drug atropine, in 1917. Some 
of the power and simplicity of that classic 
work is seen in Coulthard and colleagues’ 
remarkable solution to the problem of building 
prostaglandins. m 
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methanol (CH;OH), an acid (Amberlyst-15) and magnesium sulphate 
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It has been unclear whether a uniform group of stem cells gives rise to most 
cells in the epidermis. A study reveals the presence of at least two stem-cell 
populations that have different proliferative abilities. SEE ARTICLE P.257 
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Il renewing tissues, such as blood and 

skin, are sustained and repaired by a 

small population of resident stem cells. 
These cells have the ability to self-renew and to 
generate committed progenitor cells that dif- 
ferentiate into the cell lineages of the tissue of 
origin. However, the nature and the specific 
activities of stem and progenitor cells at dif- 
ferent body sites are a matter of debate. For 
example, several theories have been proposed 


regarding the epidermis, the outer covering 
of the skin (Fig. 1). On page 257 of this issue, 
Mascré et al.' provide compelling evidence for 
the existence of two cell populations that dif- 
fer in their proliferative dynamics, their gene- 
expression profile and their ability to repair the 
epidermis after injury. 

To investigate the origin, location, prolif- 
eration and fate of stem cells in mouse-tail 
epidermis, the authors used genetic lineage 
tracing — a technique that allows in vivo 
fluorescent marking of specific cell types and 
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Figure 1 | Models for self-renewal of the epidermis. a, One model’ of 
epidermal renewal relies on quiescent stem cells that generate transient 
actively dividing (amplifying) cells, which in turn generate non-dividing, 
differentiated cells. Curly arrows indicate self-renewing cells. b, A second 
model™ posits the existence of a single population of actively cycling, 
committed progenitor cells that directly generate differentiated cells. 

c, Mascré and colleagues’ results’ suggest that epidermal renewal is 


Activated 
stem cell 
(committed progenitor) 


of the animal and contribute to both repair and long-term regeneration of 
the epidermis. These cells give rise to actively cycling, self-renewing stem 
cells (or committed progenitors) that persist for various periods of time, 
replenish differentiated cells and make only a short-term contribution 

to wound healing. d, We suggest that slowly dividing stem cells generate 
activated stem cells that give rise to transient amplifying progenitors, which 
eventually produce differentiated cells. A similar model has previously been 
proposed for other tissues” "”. 


sustained by slowly dividing stem cells that persist during the lifetime 


their progeny — as well as elegant studies of 
individual clones (cells derived from a single 
initial cell). The specificity of the marking 
depends on the nature of the promoters (regu- 
latory DNA sequences) that are employed to 
trigger permanent expression of a fluores- 
cent protein. This is useful because different 
promoters are active in different cell types. 

Mascré et al. used two distinct promoters 
(K14 and Inv) and unambiguously identified 
two cell populations. One of these populations 
consisted of slowly dividing cells, which repre- 
sented the stem-cell reservoir and contributed 
to wound repair. These stem cells gave rise to 
a second population that comprised actively 
dividing committed progenitors, which even- 
tually generated differentiated cells. The K14 
promoter targeted both stem cells and com- 
mitted progenitors, whereas the Inv promoter 
targeted only progenitors. 

The results confirm the notion that the 
epidermis contains long-lived quiescent stem 
cells as well as cells endowed with differing 
proliferative abilities”, and argues against the 
model proposing that a single progenitor-cell 
pool sustains self-renewal and regeneration of 
the mouse epidermis” (Fig. la-c). Moreover, 
the authors’ study provides an explanation for 
previous findings that seemed to suggest a 
single-progenitor model**. These earlier stud- 
ies had used a promoter (Ach) that targets just 


one group of actively cycling cells that make 
only a short-term contribution to epidermal 
repair — in other words, the Ach-marked 
cells behaved in a similar way to Mascré and 
colleagues’ Inv-targeted progenitor cells. 
Interestingly, the two cell populations 
described by the authors both possess a fun- 
damental stem-cell property, namely, self- 
renewal. However, the slowly cycling stem cells 
persist throughout the animal’s lifetime and 
contribute to the repair and long-term regen- 
eration of the epidermis, whereas the actively 
dividing progenitors endure for shorter peri- 
ods (up to several months) and make only a 
short-term contribution to wound healing. 
Therefore, these actively dividing progenitors 
are not the canonical ‘transient amplifying 
cells’ proposed by a previous model (Fig. 1a). 
How can we define the K14- and Inv- 
labelled cells from a functional point of view? 
To try to answer this question, it may be use- 
ful to discuss what happens in other tissues. 
It has been shown’ that the bone marrow 
contains a pool of quiescent haematopoietic 
(blood-forming) stem cells that generate a 
population of rapidly cycling stem cells. K14- 
marked stem cells are functionally similar to 
quiescent haematopoietic stem cells, whereas 
Inv-marked progenitors mirror actively 
cycling, self-renewing haematopoietic stem 
cells. A similar situation has been described 
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for tissues such as hair follicles and the small 
intestine”. In all of these tissues, stem cells 
can switch between quiescence and activa- 
tion depending on the functional status of the 
tissue; that is, normal self-renewal or acute 
wound healing. In our opinion, Mascré and 
colleagues’ data support a model envisaging the 
coexistence of quiescent and activated stem cells 
asa general strategy for tissue renewal (Fig. 1c). 

The notion that the mammalian epidermis 
contains cells endowed with different prolif- 
erative dynamics raises interesting questions 
that need further investigation. What factors 
specify the fate of these two cell populations? 
What are the molecular pathways that regu- 
late stem-cell quiescence and activation in 
the epidermis, especially upon injury? And 
is the switch from quiescence to activation a 
cell-autonomous process, or is it controlled by 
environmental cues? 

The study also highlights the limitations 
of genetic lineage tracing. Results obtained 
with this technique are influenced by the 
dynamics of the stem-cell populations in the 
tissues analysed, as well as by the types of 
promoter used and by how the promoters are 
activated. For instance, the authors did not 
detect transient amplifying progenitors, but it 
is not clear whether such cells are genuinely 
absent from the mouse epidermis or whether 
they were not detected owing to the lack 


of an appropriate promoter (Fig. 1d). 

Mascré and colleagues’ work sheds light on 
the heterogeneity of the proliferative cell popu- 
lations in mouse epidermis, and has increased 
our understanding of stem-cell biology. But 
does this knowledge apply to human tissues? 
Caution is required when inferring aspects 
of human tissue physiology from animal 
data. For instance, clonal analyses of several 
human squamous epithelia (tissues such as the 
epidermis, cornea and conjunctiva) have 
unambiguously shown the existence of 
self-renewing cells endowed with stem-cell 
properties, as well as non-self-renewing cells 
with differing capacities for multiplication, 
including canonical transient amplifying 
cells*”. Both cell types participate in the 
regeneration of these epithelia in the clinic’®. = 
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The rainforest’s 


water pump 


An investigation of naturally occurring water recycling in rainforests finally 
marries the results of global climate models with observations. Alarmingly, it also 
suggests that deforestation can greatly reduce tropical rainfall. SEE LETTER P.282 


LUIZ E. 0. C. ARAGAO 


he humid tropics contain more than 

35% of global forests, covering an area 

of 11,564,000 square kilometres (ref. 1). 
Tropical trees can extract deep soil water and 
pump it back to the atmosphere through a 
process called evapotranspiration. Asa result, 
25-56% of the rainfall in these regions can be 
recycled within the ecosystem’. Large-scale 
climate models indicate that this mechanism 
— which maintains atmospheric moisture and 
so feeds far-inland regions with rainfall — may 
be weakened by the removal of existing tropi- 
cal forests’. Yet, paradoxically, an increase in 
local rainfall over deforested areas has been 
observed’, 

On page 282 of this issue, Spracklen et al.° 
report an analysis of tropical rainfall that com- 
bines models of atmospheric transport with 
satellite observations of rainfall and vegetation 
cover*. The authors conclude that, for more 
than 60% of pan-tropical land (that is, the trop- 
ics across all continents), air masses that have 
travelled over extensive vegetated surfaces can 
generate at least twice as much rainfall as air 
masses that have flowed over deforested lands. 
On the basis of their findings, the authors pre- 
dict a potentially widespread reduction of 
rainfall in the Amazon basin, if deforestation 


*This article and the paper under discussion® were 
published online on 5 September 2012. 


in the region were to continue at the rate’ that 
occurred from 1997 to 2002. 

Sustaining high rates of global human- 
population growth requires an increase in 
farmable land, energy and timber supplies. 
Without adequate planning, these needs 
tend to be met by cutting down rainforests. 
Because ecological processes and human 
activities (such as agriculture) depend on 
water provision, scientists have, for at least 
two decades, investigated the impact of tropi- 
cal deforestation on the hydrological cycle, 
especially in Amazonia. 

Pioneering studies** using global climate 
models (GCMs) found that vegetation removal 
can disrupt the water cycle. For example, 
models show that changes in land surface 
characteristics cause a reduction in evapo- 
transpiration rates and so induce basin-wide 
decreases in Amazonian rainfall’. Another 
simulation shows that removing around 
40% of the original forest cover could 
drive Amazonia into an irreversible, drier 
climate mode’. 

These findings, however, are at odds with 
observations. Studies indicate that rainfall over 
deforested areas is, in fact, higher than in adja- 
cent forests’. This is because increased surface 
heating in deforested areas induces upward 
air motion, reducing air pressure and drawing 
moist air from neighbouring forests into the 
openings. The moist air rises, and so generates 
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ow is an exciting time to study the microbes 
Ne are associated with the body — the human 

microbiota. Although the beneficial effects of 
some of these microbes have been documented for more 
than a century, only with recent technological advances can 
the composition of the diverse microbial communities be 
described and compared within and between individuals. 
Two major funding initiatives — the US Human 
Microbiome Project and the European Metagenomics of 
the Human Intestinal Tract — are well under way, and with 
the contributions of individual pioneer research groups, 
we are beginning to understand what constitutes a healthy 
microbiota and how changes in its structure correlate 
with and affect health and disease. The more we learn 
about these microbes, the more we realize that they are 
influencing many aspects of our physiology. 

This Insight focuses on the gut — the body site that 
is most densely populated with microbes. The greatest 
challenge in microbiota research is the number and 
diversity of species in the intestine. Rob Knight and 
colleagues provide a view on intestinal microbiota using 
principles derived from macroecology, and discuss how 
these principles can help to unravel the factors that shape 
microbial communities and may offer ways to target the 
microbiota for clinical intervention. Casey Weaver and 
colleagues review the interplay between the microbiota 
and the immune system by discussing the mechanisms 
that maintain a healthy balance, as well as the pathological 
consequences following perturbation. One of the main 
functions of gut microbes is to aid nutrient acquisition 
from otherwise indigestible fibres. Valentina Tremaroli 
and Fredrik Backhed explore the interactions between 
microbes and metabolic processes, and describe how 
these exchanges can lead to either a healthy equilibrium or 
metabolic disease. Much of the research would not have 
been possible without advances in sequencing technology 
and analysis of the microbial community genome — the 
microbiome. George Weinstock reviews the genomic tools 
that researchers can use to analyse the microbiota. 

We hope that these Reviews capture some of the most 
exciting insights into microbiota research. We are pleased 
to acknowledge the financial support of Yakult Honsha 
in producing this Insight. As always, Nature carries sole 
responsibility for all editorial content. 


Claudia Lupp, Magdalena Skipper & Ursula Weiss 
Senior Editors 
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CAREERS 


TURNING POINT Astrobiologist’s passion for 
Mars science creates opportunities p.328 
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latest news and features go.nature.com/e492gf 
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COLUMN 


Working together 


The right administrative system can ease the job search 
for dual-career couples, argues Mary Anne Holmes. 


puses around the world. A faculty search 
committee ploughs through piles of appli- 
cations, sorts them, evaluates them, brings 
the best candidates to campus and interviews 
the top few. It eventually arrives at the perfect 
person for the job. The head of the depart- 
ment makes the offer — only to discover that 
the perfect candidate has a partner, and the 
dreaded two-body problem must be addressed. 
It can be a nightmare: the whole search 


| t isa process common on university cam- 


collapsing as the clock runs out and the com- 
mittee tries to find a position for the partner. 
But at the University of Nebraska-Lincoln 
(UNL), we have developed a means of dealing 
with the problem to help both candidate and 
recruiter. 

By the time a job offer is made, a substan- 
tial amount of time and money has been 
invested. At the UNL, departments spend 
between US$9,000 and $18,000 simply adver- 
tising a position. Factor in the time spent by 


faculty members on reviewing applications — 
10-20 hours, depending on the size of the pile 
— and the costs are even higher. No one wants 
a candidate to turn down an offer. 

The joint hiring of academic couples has 
quadrupled since the 1970s. By 2008, 36% of 
academics in the United States had a partner 
who also worked in academia. And the rate was 
highest among scientists: 54% of male and 83% 
of female scientists had a partner in academic 
science, most ina similar field (L. Schiebinger 
et al. Dual-Career Academic Couples: What 
Universities Need to Know; Stanford University, 
2008). With so many female scientists part- 
nered with an academic, strategies to increase 
the number of women employed as faculty 
members in science, technology, engineering 
and mathematics should include a means of 
addressing dual-career opportunities. 

The UNL tackled this question using an 
Institutional Transformation award from the 
US National Science Foundation’s ADVANCE 
programme, which aims to improve the rep- 
resentation of women in academic scientific 
careers. The university came up with a process 
that should transfer well to other institutions. 


FROM THE START 

The UNL Dual Career Assistance initiative 
addresses the two-body problem early in the 
negotiation and recruiting process. Primary 
candidates are informed about the programme 
once they have been shortlisted, but before they 
are interviewed. They are told that if they have 
a partner who is qualified for a faculty position, 
the partner should send a letter of interest, a 
CV and a statement of research and teaching 
interests to the ADVANCE office. The direc- 
tor of the ADVANCE programme notifies the 
appropriate department chair and the college 
dean of the potential dual-career hire. 

This initiative gives the candidate a safe place 
to disclose their relationship status without 
making it a factor in deliberations, and pro- 
vides a relay office to alert all parties. The extra 
time allows the university to bring the partner 
to campus for an interview as soon as the offer 
for the primary candidate is put together. The 
original search committee is left out of the flow 
of information, so that it doesnt inappropriately 
weigh relationship status as a criterion for selec- 
tion. After all, it may be hard to resist the urge to 
speculate on how having a partner will affect a 
candidate's likelihood of accepting an offer. But 
relationship status, of course, has little to do with 
the candidate's suitability for the position. 

Ifa candidate with an academic partner is > 
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> ultimately selected for a faculty position 
and the partner's target college and depart- 
ment are interested, the partner is flown to 
Lincoln for an interview. If the interview 
goes well and faculty members in the tar- 
get department vote to make an offer to 
the partner, two offers go forward. If the 
department is not interested, the process 
stops and the primary candidate and their 
partner have to decide whether to come to 
the university with just one job (we have 
lost a few candidates this way). 

The ADVANCE grant supplies one- 
quarter of the salary for the partner for up 
to three years, with the department, college 
and provost’s office each supplying a further 
quarter. The university's office of research 
provides start-up funds. This three- 
year bridge funding buys time for all the 
administrators to find a permanent fund- 
ing stream for the partner, which might 
be provided by a new funding line or the 
retirement of an existing faculty member. 

Sometimes, the primary candidate has 
a partner who does not want to stay in 
academia. We can assist with external job 
searches through our networks and the 
local chamber of commerce, or the partner 
can look for a job independently. 

The UNL’ original ADVANCE grant 
was for five years, and we anticipated that 
as many as eight dual-career hires would be 
made over that time. In fact, after four years 
we have hired 12 partners. 

What happens when the ADVANCE 
funding runs out? Bridge funding could 
help. But there are ways, in principle, for 
universities to institutionalize this process 
without any grants. One model could be 
to reserve a proportion of new positions 
for dual-career hires. The provost or dean 
could refrain from filling a few positions 
each year, or could raise an endowment. 

Overall, the UNL has had more success 
with these formal procedures than with 
informal last-minute requests to accom- 
modate a partner. The programme works in 
large part because department chairs agree 
to cooperate: if we accept a partner now, the 
reasoning goes, another department will 
help us in the future. Some administrators 
have expressed concern that a formal dual- 
career programme will raise the expecta- 
tions of partners, only for these to be dashed 
when no position is forthcoming. However, 
we find that prospective faculty members 
appreciate the transparency of the process. 

Dual-career partnership is not a passing 
fad. Addressing this need must be an integral 
part of future universities if we are to attract 
and keep the best and the brightest. = 


Mary Anne Holmes is a professor of Earth 
and atmospheric sciences at the University 
of Nebraska-Lincoln and director of 
ADVANCE-Nebraska. 
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Alberto Gonzalez Faireén 


As a teenager in Spain, Alberto Gonzalez 
Fairén watched US astronomer Carl Sagan's 
television series Cosmos, read the book based 
on the series and knew that he wanted to be 

a scientist. He is now an astrobiologist at the 
SETI Institute’s Carl Sagan Center for the Study 
of Life in the Universe in Mountain View, 
California, and the NASA Ames Research 
Center in Moffett Field, California. In October, 
Fairén will collect the Harold C. Urey Prize for 
outstanding achievement in planetary science 
from the American Astronomical Society, 
based in Washington DC. 


What was it about Cosmos that sparked your 
interest in science? 

Listening to Sagan telling stories about how 
science is done and how scientists work, and 
describing the thrill of scientific discovery, I 
said to myself, “That’s what I want to do” 


You did a master’s degree in genetics. How did 
you move from that to Martian environments? 

It was the other way around. I’ve been very 
interested in planetary science and astro- 
biology since I was a teenager, but there were 
no university programmes in the field in 
Spain at the time. Genetics was a distant sec- 
ond interest to me, but I needed to graduate to 
start my PhD and begin working asa planetary 
scientist. Astrobiology was starting to become 
respectable, and I thought that a background 
in genetics would be appropriate for an astro- 
biologist interested in Martian environments. 


What are your plans for the future? 

I’m leaving the SETI Institute because ‘soft 
money — working solely off grants — is no 
longer an option for me. When you have a fam- 
ily, you need some kind of stability. Pve been 
offered two academic positions in the United 
States, and I’ve won a starting grant from the 
European Research Council. If laccept one of 
the offers here, I will stay in US Mars science, 
which is very attractive right now, especially 
after the success with the Curiosity rover land- 
ing. If I go back to Europe, I will have substan- 
tial funding to launch my own group, which 
seems the logical next step. 


Describe your biggest career milestone. 

Carbonates are minerals produced in water; 
the fact that none had been found on Mars had 
long been used to argue that there were never 
oceans on Mars. But my adviser told me about 
a river in Spain that was very acidic; I knew 
that carbonates do not form in acidic water. 
I was two years into my PhD and proposed that 
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oceans on Mars were also acidic and that this 


acidity inhibited the formation of carbonates. 
A team including me and my adviser submitted 
a paper on the subject to Nature on 7 January 
2004. NASA's rover Opportunity landed on 
Mars days later — and found direct evidence 
for such acidic environments. We had accu- 
rately anticipated the rover’s results. Our paper 
was published on 23 September 2004, and was 
central to the completion of my PhD. 


Does the Urey prize have a special meaning 
for you? 

It won't help me to get grants or positions, but 
it is a high personal honour. There is a long list 
of names of very, very, very important people 
in planetary science who have won this award. 


Has it been difficult to adjust to living and 
working in the United States? 

Not really. Life in California is not so different 
from life in Spain, and I already had a good 
command of English. The biggest difference is 
that the United States rewards effort and merit. 


Do you have any regrets about your career? 
No. Some time ago I realized that my errors 
were necessary to bring me to where I am now. 
I didn't begin to pursue my PhD until I was 30, 
and while I was away from science I worked as 
a bartender, a taxi driver, a film-rating classifier 
and a teacher, among other things. But I got my 
PhD in 2006 and started working for NASA, 
and that was a turning point in my career. It 
offered me the freedom and resources to pur- 
sue my own investigations. My career would 
have been totally different without the NASA 
experience. 


INTERVIEW BY KAREN KAPLAN 


Ua SCIENCE FICTION 


BY JOHN GILBEY 


e‘d been dating for years, but we 
were still tiptoeing around the 
question that faces many cou- 


ples in our situation. It was, we agreed very 
late one night, a matter of commitment and 
trust — and not something to take on lightly. 

In retrospect, it was 
probably me that blocked 
the discussion from going 
any further which — I real- 
ized far too late — may well 
have been why Helen was 
so enthusiastic about the 
Silicon Valley job. It was 
a wonderful opportunity: 
three months with a bio- 
data start-up company in 
Mountain View getting 
them up to speed on the 
new techniques that she 
had just published. 

Sure, we vowed to stay 
in touch, but as her work 
ramped up and the sched- 
ule began to bite, our con- 
versations all but petered 
out. Glumly, I began to 
understand just what a 
mistake I'd made. I had, I 
reasoned, completely blown it — so I was 
really surprised when she wanted to see me 
the first evening after shed got back from her 
trip. She had, she said, something to tell me. 

We agreed to meet for dinner at the Tan- 
doori place near the railway station — where 
wed had our first date. I got there early and 
sat nervously scrunching poppadoms and 
drinking too much cold Indian lager while 
mentally preparing my speech of apology. 

She looked fantastically alive. The Califor- 
nian sun had tinted her hair slightly auburn 
and her skin had acquired a fresh glow of 
health and well-being. Helen smiled at me 
— the smile of someone who has important 
news that they are dying to share. I put two 
and two together and asked her if she was 
pregnant. Her laugh came as a relief to me, 
although the loud whoop I gave gained me 
some baleful stares from the other diners. So, 
if not that — then what? 

Helen turned slightly in her chair and 
scooped her hair aside with one hand. There, 
at the base of her skull, was the thing we had 
spent so long discussing: a small, dimpled 
disk of alloy, anodized a fashionable lime 
green, laser engraved with the tiny characters 


The ties that bind. 


defining her range of MAC addresses. Helen 
had got a Connection. 

I can't remember everything I said, but I 
know I used “betrayal” a few times — and 
“selfish’, and possibly a few worse epithets. 
Helen sat serenely until I'd finished my rant, 
then she smiled away the waiters who were 
trying to come to her rescue. 


She looked at me with something that 
I hoped was compassion — but it was the 
kind oflook you give to a grumpy child who 
doesn’t understand the adult world. 

“I know we said wed wait — but when I 
got to the lab and saw how much synergy 
everyone got from being Connected, I just 
couldn't resist. They were really kind, but 
I could see that I wouldn't be able to work 
there effectively without it. It’s more than 
that though — it gives a wonderful sense 
of... well, belonging and understanding” 

I asked — testily — for an example, and 
Helen thought for a moment. “Well, things 
like knowing that the X32 bus is running nine 
minutes late, that the guy over there in the 
corner is really excited about his new media 
job — he is here to celebrate with his friends 
but they are still on the bus. He thinks I’m 
cute, by the way — he especially likes my... 
Uh, never mind. Oh, and it’s going to rain 
later — so I'd like to be home by ten .. . There 

is just so much richness 


> NATURE.COM and texture; I thought 
FollowFutureson —_you, ofall people, would 
Facebook at: understand” 

go.nature.com/mtoodm Just because I’m a 
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software developer doesn't mean I have to 
like the stuff, I thought gloomily. Having 
worked in the industry for years, I know 
just what happens when you are working 
against a wildly optimistic deadline with a 
microscopic budget — and that isn’t the sort 
of code I want plumbed permanently into 
my nervous system, even if it does mean I 
can read my e-mail while 
walking down the street. 
I considered remonstrating 
with her about the lack of 
adequate testing protocols, 
the myriad unknown risks 
of interactions and privacy 
— dont get me started on 
privacy — but they were all 
old, stale arguments. 

Helen now had a golden 
world — a world without 
me — with the millions of 
other Connected folk who 
obviously shared startlingly 
intimate details of her life. 
Nothing I could do or say 
would change her delight 
in this new set of senses. I 
have to admit, though, that 
I was curious about what it 
felt like when the software 
was running, so I asked her. 

She wrinkled her nose, then her eyes 
defocused slightly while she reviewed her 
symbiotic systems and tried to put words to 
the sensations. “Kind of nice, like wearing 
a cosy pair of slippers — or walking with a 
friend. But the best bit is when an application 
gets upgraded, there’s a sort of tingly rush of 
anticipation like you get on Christmas Eve 
when youre a kid” 

Her cheeks flushed slightly in a way that 
I recognized from a much more private 
moment. “In fact; she said quietly, “the core 
system is updating itself right now” 

I waited expectantly for a moment, and 
then slowly became aware of the lack of 
movement in the space around me. Across 
the table, Helen was rigid — with her pupils 
slowly dilating. As I watched with mount- 
ing horror, a small bead of saliva slid from 
one side of her mouth. Then the screaming 
started: sharp, guttural and inhuman — but 
it was a long time before I realized it was 
coming from me. = 


John Gilbey is currently wrestling with his 
first science-fiction novel, Contextual, and 
fears that it may be gaining the upper hand. 
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Chronic lymphocytic leukaemia is driven by 


antigen-independent cell- 


autonomous signalling 


Marcus Dithren-von Minden', Rudolf Ubelhart'2*, Dunja Schneider’, Thomas Wossning’, Martina P. Bach', Maike Buchner’, 
Daniel Hofmann!, Elena Surova’?, Marie Follo*, Fabian Kéhler', Hedda Wardemann‘*, Katja Zirlik®, Hendrik Veelken® 


& Hassan Jumaal® 


B-cell antigen receptor (BCR) expression is an important feature of 
chronic lymphocytic leukaemia (CLL), one of the most prevalent 
B-cell neoplasias in Western countries’. The presence of stereotyped 
and quasi-identical BCRs in different CLL patients suggests that 
recognition of specific antigens might drive CLL pathogenesis. 
Here we show that, in contrast to other B-cell neoplasias, 
CLL-derived BCRs induce antigen-independent cell-autonomous 
signalling, which is dependent on the heavy-chain complementarity- 
determining region (HCDR3) and an internal epitope of the BCR. 
Indeed, transferring the HCDR3 of a CLL-derived BCR provides 
autonomous signalling capacity to a non-autonomously active 
BCR, whereas mutations in the internal epitope abolish this capacity. 
Because BCR expression was required for the binding of secreted 
CLL-derived BCRs to target cells, and mutations in the internal 
epitope reduced this binding, our results indicate a new model for 
CLL pathogenesis, with cell-autonomous antigen-independent 
signalling as a crucial pathogenic mechanism. 

CLL cases can be divided into two groups based on the mutational 
status of the immunoglobulin genes. Notably, in B cells from ~1% of 
both groups, unmutated CLL and mutated CLL cases express almost 
identical BCRs, and in 30% of cases the BCRs can be classified into 
certain stereotypes based on structural similarities and the HCDR3 (refs 
2-5). This restricted BCR repertoire has been interpreted to be the result 
of a BCR-driven selection mechanism initiated by specific antigens that 
may promote the expansion of the respective CLL clone®”’. 

We have recently reported functional similarity between the pre- 
BCRand polyreactive BCRs, and have demonstrated that both activate 
cell-autonomous signalling in pre-B cells*. Furthermore, we have 
shown that the pre-BCR induces ligand-independent cell-autonomous 
signalling by surrogate light-chain-mediated binding to an intrinsic 
pre-BCR glycosylation site’. CLL-derived BCRs are also often 
polyreactive'’", suggesting that, in analogy to the pre-BCR, CLL 
BCRs might activate ligand-independent autonomous signalling 
through the binding of BCR-intrinsic motifs. We tested this hypothesis 
by comparing the signalling capacity of randomly chosen BCRs from 
different B-cell lymphomas in triple knockout (TKO) cells that do not 
express endogenous pre-BCRs or BCRs owing to inactivation of RAG2 
and j5 genes. An additional deficiency in SLP65 (also known as 
BLNK) expression makes TKO cells an ideal reconstitution system 
to test receptor function by transfection of receptor components and 
a tamoxifen-inducible ERT2-SLP65 fusion protein’. As shown previ- 
ously’, the pre-BCR and a polyreactive human BCR derived from 
an early bone marrow B cell (BCR62) activate Ca’** mobilization 
autonomously without additional ligands, whereas a non-polyreactive 
human BCR derived form a selected mature B cell (BCR53) requires 
additional crosslinking (Supplementary Fig. 1a and data not shown). 
Similar to BCR53, we found that BCRs derived from multiple myeloma 


(n= 4), marginal zone lymphoma (n= 3), mantle cell lymphoma 
(n = 4) and follicular lymphoma (n = 4) required crosslinking with 
anti-BCR antibody in addition to SLP65 activation to induce Ca** flux 
(Fig. 1, Supplementary Fig. 1b, c and Supplementary Table 1; data not 
shown). By contrast, all tested (n = 17) CLL-derived BCRs, including 
those containing typical CLL variable heavy chain (VH) elements (for 
example, IGHV4-34, IGHV3-7, IGHV3-23, IGHV3-30 and IGHV1- 
69)? and stereotyped HCDR3s, induced autonomous signalling on 
SLP65 activation (Fig. 1, Supplementary Fig. 2a and Supplementary 
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BCR53 
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BCR53 


UM-CLL1 MCL1 FL1 (MM1) 


Ca?* flux 


BCR53 


UM-CLL2 (MCL2) 
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Figure 1 | CLL-derived BCRs possess autonomous signalling capacity. 
Fluorescence-activated cell sorting (FACS) analyses of Ca** flux after the 
addition of 4-hydroxytamoxifen (4-OHT) to TKO cells expressing the ERT2- 
SLP65 fusion protein and four representative CLL-derived BCRs (left column), 
two representative BCRs of multiple myeloma (MM), mantle cell lymphoma 
(MCL), marginal zone lymphoma (MZL) and follicular lymphoma (FL) 
(middle two columns) and BCRS53 variants containing different HCDR3s of 
representative CLL, multiple myeloma or mantle cell lymphoma receptors 
(right column). Addition of the stimulus is indicated by black arrows. M, 
mutated; UM, unmutated. 
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Table 1; data not shown). No obvious difference was found between 
mutated (n = 10) and unmutated (n = 7) CLL BCRs (Fig. 1 and data 
not shown). Importantly, BCRs derived from healthy donors who 
possess CLL-typical VH genes such as IGHV1-69 or IGHV4-34 did 
not display autonomous signalling (Supplementary Fig. 2b). 

To investigate whether autonomous signalling is also involved in 
lymphoproliferation in animal models of CLL, we examined TCL1 
mice, which express a transgene for human T-cell leukaemia 1 
(TCL1) under the control of the VH promotor and the Ep enhancer’. 
TCLI is highly expressed in unmutated CLL” and TCL1 transgenic 
mice show clonal expansion of CD5* IgM" B cells bearing unmutated 
BCRs'*'>. To test whether transformed clones derived from TCL1 mice 
express autonomously active BCRs, we studied four different BCRs 
from TCL1 mice in the TKO reconstitution system. All four TCL1- 
derived BCRs showed autonomous Ca*~ flux similar to the human 
CLL BCRs and the murine pre-BCR, whereas a 4-hydroxy-3-iodo-5- 
nitrophenylacetyl (NIP)-specific murine BCR (B1-8) showed no 
autonomous signalling (Supplementary Fig. 3a, b). Similar to human 
B-cell CLL, TCL1-transformed murine B cells are CD5*. Therefore, 
we tested whether autonomous signalling is a characteristic feature of 
murine Bla B cells that are typically CD5* and IgM". Interestingly, 
BCRs from Bla B cells did not induce autonomous signalling, indi- 
cating that this capacity is a unique feature of leukaemia-derived BCRs 
(Supplementary Fig. 3c). Together, our data show that, in contrast to 
all other B-cell lymphomas tested, BCRs derived from CLL patients 
and TCL1 mice possess the unique capacity to induce autonomous 
signalling. 

The bias in VH gene usage and the high proportion of stereotyped 
HCDR3s in CLL suggest that HCDR3 has a critical role in the develop- 
ment of this disease**”. To test whether transferring HCDR3 from a 
CLL-derived BCR transfers autonomous signalling capacity to a non- 
autonomously active BCR, we replaced the HCDR3 of BCR53 with 
three different CLL-derived HCDR3s. As a negative control, we 
replaced the HCDR3 of BCR53 with the HCDR3s of a multiple 
myeloma BCR and a mantle cell lymphoma BCR that contained the 
same VH gene as the chosen CLL BCRs. Although all modified BCR53 
derivatives were expressed on the cell surface, only those containing a 
CLL-derived HCDR3 acquired the capacity for autonomous signalling 
(Fig. 1 and Supplementary Fig. 1; data not shown). 

Our previous results indicated that the ability to induce autonomous 
Ca’* flux was associated with BCR polyreactivity*. Because most 
secreted CLL-derived BCRs were reported to be polyreactive’’, we 
tested the chosen CLL BCRs for polyreactivity. Only three out of six 
randomly chosen CLL-derived BCRs—two unmutated and one 
mutated—showed polyreactive potential by binding to more than 
two antigens in enzyme-linked immunosorbent assays (ELISA) and 
showing cytoplasmic staining of HEp-2 slides in indirect immuno- 
fluorescence assays (Supplementary Fig. 4a, b). Furthermore, only 
one out of four tested TCL1 BCRs (TCL1 no. 3) showed polyreactivity 
in the same assays (Supplementary Fig. 4c, d). The frequent lack of 
polyreactivity suggests that other mechanisms induce the autonomous 
signalling of CLL-derived BCRs. To show that specific antigens on 
adjacent cells are not required for the activation of CLL BCRs, we 
examined autonomous Ca*” signalling in isolated, single cells. We 
detected autonomous Ca*" flux in single cells expressing a CLL- 
derived BCR, whereas additional stimulation with anti- BCR antibodies 
was required for activation of single cells expressing the BCR53 control 
(Fig. 2a, b). 

The lack of polyreactivity suggests that antigen-independent 
signalling might be achieved by HCDR3-dependent binding of BCR- 
intrinsic motifs, suggesting mutual BCR recognition on CLL cells. 
Indeed, secreted CLL-derived BCRs bound to the surface of TKO cells 
expressing a CLL BCR or the control BCRS53, although they failed to 
bind to TKO cells that did not express a receptor (Fig. 3a, b and 
Supplementary Fig. 5a). This suggests that the CLL-derived BCRs 
might interact with intrinsic BCR structures and that this interaction 
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Figure 2 | CLL BCRs induce cell-autonomous signalling. a, Live-cell 
imaging of single TKO cells expressing ERT2-SLP65 and unmutated CLL2 or 
BCRS53 after induction with 4-OHT. Images show merged channels of Indo-1 
unbound (blue) and Indo-1 bound (violet). Magnification is x 40; scale bars, 
25 tum. b, Ca** flux of representative single cells from a. Kinetics are 
representative of at least ten single cells. 


could mediate antigen-independent stimulation if it takes place within 
the membrane of the same cell. Using a phage-display approach, a 
recent study identified different peptide motifs that bind to primary 
CLL cells in a HCDR3-dependent manner, and one of these motifs 
showed specific binding to each of the tested CLL samples'®. Through 
sequence analysis we found that this motif possesses homology to a 
conserved epitope in the framework region 2 (FR2) of VH domains 
(Supplementary Fig. 5c). To test whether this region contains an 
internal epitope important for autonomous signalling, we generated 
point mutations of the amino acids Val37 and Arg38 within this 
motif. As a control we mutated a conserved arginine (Arg 43) outside 
of the motif. Our results show that mutating Val 37 or Arg 38, but not 
Arg 43, abolishes autonomous signalling of CLL BCRs, whereas stimu- 
lation with anti-BCR antibodies show normal Ca’ * flux for all mutants 
(Fig. 3c, d and Supplementary Fig. 5b, d; data not shown). As the 
introduced mutations are located outside of the antigen-recognition 
site of the BCR, these data indicate that the observed autonomous 
signalling is not induced by BCR interaction with external antigens 
in the environment, because such antigens should still be able to bind 
to and stimulate the mutated CLL BCR. To confirm this view, we 
mutated Val 37 and Lys 38 in the NIP-specific B1-8 BCR and found 
that the resulting mutant BCRs show normal antigen-dependent 
stimulation after incubation with NIP-bovine serum albumin 
(Fig. 3c and Supplementary Fig. 5b). Thus, these data indicate that 
the loss of autonomous signalling was caused by the inability of the 
CLL BCR to bind to the mutated intrinsic motif. In agreement with 
this, the binding of secreted CLL BCRs to TKO cells was reduced when 
the cells expressed a BCR with a mutated intrinsic motif (Fig. 3e). 
Together, our results suggest that an intrinsic BCR motif and 
HCDR3 mediate autonomous BCR stimulation in CLL-derived BCRs. 
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Figure 3 | Autonomous signalling of CLL-derived BCRs is mediated by the 
recognition of a BCR-intrinsic epitope. a, Surface-binding profiles of different 
secreted CLL BCRs on TKO cells expressing empty vector (EV). Secreted BCR53 
and anti-CD19 antibody were used as negative and positive controls, 
respectively. The amount of secreted BCR used was >100-fold higher than for 
the anti-CD19 antibody. b, Surface-binding profiles of secreted M-CLL1, 
M-CLL2 and BCR53 on TKO cells expressing the indicated BCRs. c, d, FACS 
analysis for autonomous Ca” flux after addition of the indicated stimuli in TKO 
cells expressing the ERT2-SLP65 fusion protein. c, The UM-CLL2 mutants 
V37G and R38A (left) or the corresponding B1-8 mutants (right). d, The UM- 
CLL2 R43A mutant. Arrows indicate addition of the stimuli. e, Surface-binding 
profile of UM-CLL2 on TKO cells expressing B1-8 or B1-8 V37G. 


To show that autonomous signalling can also be detected in primary 
CLL samples, we monitored Ca*~ flux in primary B cells from CLL 
patients and compared them to primary peripheral B cells from 
healthy donors or patients with other B-cell neoplasias. Because 
SLP65 function is constitutively active in primary cells, a synchronous 
start of autonomous signalling as in the TKO cells cannot be achieved 
in primary cells. However, constitutive autonomous signalling 
suggests that a proportion of the primary cells show increased Ca** 
flux. In fact, our experiments show that primary CLL samples show 
elevated Ca** mobilization compared with the control cells. 
Moreover, this signalling capacity required BCR function, as inhibition 
of the protein tyrosine kinase SYK, which transmits BCR signals’’, 
prevented this activity (Fig. 4 and Supplementary Fig. 6a). Similarly, 
monitoring Ca** mobilization in single cells showed periodic SYK- 
dependent signals in isolated primary CLL B cells but not in control 
cells (Fig. 4 and Supplementary Fig. 6b). As survival of CLL B cells is 
associated with nuclear-factor «B (NF-«B) activation’’, we investi- 
gated the expression of NF-«B-regulated survival genes'® in primary 
CLL cells and found that inhibition of autonomous BCR signalling by 
inhibition of SYK led to a reduction in the expression of MYC and 
XIAP (Supplementary Fig. 6c). 
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Figure 4 | Cell-autonomous Ca’* signalling is elevated in primary CLL B 
cells. Ca”* flux in primary peripheral B cells from a healthy donor or a CLL 
patient with or without R406 treatment (left panels; addition of an anti-BCR 
stimulus as a control for cell viability and Indo loading is indicated by a black 
arrow) and Ca’* flux in single cells from a healthy donor or a CLL patient (right 
panels). DMSO, dimethylsulphoxide. 


BCR expression is commonly used as a diagnostic marker for the 
characterization of CLL patients. The restricted BCR repertoire of both 
mutated and unmutated CLL suggests a potentially important role for 
BCR signalling in the transformation and expansion of CLL B cells. 
The restricted BCR repertoire and the observed polyautoreactivity 
suggested that CLL BCRs might recognize extrinsic autoantigens, 
thereby inducing the selective expansion of the respective B cells. 
Therefore, blocking such autoantigens or blocking BCR function by 
inhibiting downstream signalling factors, such as the tyrosine kinase 
SYK, are considered as potential therapies for CLL’. 

However, our data indicate that CLL BCRs induce cell-autonomous 
signalling independent of extrinsic antigens and identify an intrinsic 
motif that might provide new targets for the therapy of CLL by inter- 
fering with autonomous BCR stimulation. It is conceivable that addi- 
tional intrinsic epitopes exist that activate the autonomous signalling of 
distinct CLL BCRs. In this scenario, the HCDR3-driven interaction with 
distinct epitopes on nearby BCRs is sufficient for the induction of auto- 
nomous signalling on CLL cells. In agreement with this, HCDR3 
sequences derived from CLL BCRs were able to transfer the autonomous 
signalling capacity to other receptors in our experiments. Presumably, 
only certain HCDR3 sequences have this unique capacity, which might 
explain the restricted repertoire and the requirement for certain HCDR3 
regions in CLL. Thus, the presence of distinct intrinsic epitopes may 
explain the observed structural similarities of CLL BCRs, which are 
probably selected on the basis of the ability of HCDR3 to interact with 
such intrinsic structures. It should be noted, however, that the presence 
of intrinsic epitopes and their role in autonomous signalling does not 
rule out that for certain cases, extrinsic antigens are involved in the 
pathogenesis of CLL. This may further potentiate the basal activity of 
CLL BCRs. Our study presents a new mechanism for the pathogenesis of 
CLL and suggests that investigating intrinsic BCR-associated epitopes 
may lead to the development of new therapeutic approaches based on 
the inhibition of the autonomous BCR activation in CLL cells. 


METHODS SUMMARY 


Details of lymphoma material, clinical data, mice strain data, cells and cell-culture 
conditions, polyreactivity assays, plasmids and retroviral transduction, flow cyto- 
metry and Ca’* flux analysis, single-cell Ca** flux analysis, NF-KB-activity assays, 
reverse-transcriptase PCR and statistical analysis are provided in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Lymphoma material and clinical data. CLL B-cell samples were obtained from 
the peripheral blood of patients at the University Medical Center Freiburg. Data 
for IGVH status, ZAP70 expression and time to first treatment were taken from 
the clinical records. The local ethics committee approved patient sampling and all 
patients gave informed consent. 

Mice. Ep-TCL1 mice (provided by C. Croce) were described previously’*. Mice 
were bred in the animal facility of the Max Planck Institute of Immunobiology and 
Epigenetics under specific pathogen-free conditions. Animal experiments were 
done in compliance with the guidelines of the German law. 

Cells and cell-culture conditions. HEK293T, Phoenix and TKO cells were 
cultured in Iscove’s Medium (Biochrom AG) containing 5% FCS (PAN- 
Biotech), 10mM 1-glutamine (Gibco) and 100 Uml' penicillin/streptomycin 
(Gibco). For TKO cultures, the medium was supplemented with 50mM 
2-mercaptoethanol (Gibco) and supernatant of J558L mouse plasmacytoma cells 
stably transfected with a murine IL-7 expression vector. 

Primary peripheral B cells from CLL patients and healthy donors were cultured 

in RPMI containing 10% FCS, 10mM t-glutamine and 100U ml * penicillin/ 
streptomycin for 24h. For the inhibition of SYK, cells were then treated with 
4 uM R406 (Rigel) for 1h. 
ELISA and indirect immunofluorescence assays. ELISA assays were performed 
using microtitre plates coated with 10 1g ml’ of single-stranded DNA, double- 
stranded DNA (ORGENTEC), lipopolysaccharide or human insulin (Sigma- 
Aldrich). CLL antibodies were produced as described previously’ and purified 
antibodies were used at 1 mgml ‘ and three serial dilutions of 1:4 in PBS. All 
ELISA assays were developed with horseradish-peroxidase-labelled goat anti- 
human IgG (BioSource). D495 nm Was measured using a microtitre plate reader 
(SpectraMax 190). 

Indirect immunofluorescence assays were done as previously described”. In 
brief, diagnostic HEp-2 slides were incubated with 15 ,l purified CLL antibodies 
(1mgml '). After washing, a Cy3-conjugated anti-human IgG1 antibody 
(Jackson ImmunoResearch Laboratories) was used to detect binding. Nuclear 
staining was performed with 4’,6-diamidino-2-phenylindole (DAPI). 

Plasmids and retroviral transduction. ERT2-SLP65 (ref. 12) was expressed 
using a vector containing tdTomato as expression control (a gift from R. Y. 
Tsien). Immunoglobulin heavy chain and light chain were expressed using the 
biomolecular fluorescence complementation vector system described previously*. 

CLL-derived BCRs were amplified from patient samples by anchor-PCR using 
poly-G-tailed complementary DNA and a _ poly-C-containing primer as 
described*”*. Human Vu domains were fused to the murine constant region 
(1CH) by PCR, whereas the complete human light chains were used. Other 
lymphoma-derived BCRs were amplified from sequencing vectors containing 
heavy-chain or light-chain sequences of multiple myeloma, mantle cell 
lymphoma, marginal zone lymphoma and follicular lymphoma (CellGenix). 
For virus production, phoenix cells were transfected using FuGeneHD as 
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recommended by the manufacturer’s protocol. Supernatants were collected 48 h 
after transfection and used for transduction of TKO cells as described’’. 

Flow cytometry and Ca”* flux analysis. For FACS analyses, TKO cells were 
stained with goat anti-mouse IgM (Southern Biotech), goat anti-human « light 
chain and goat anti-human A light chain (Southern Biotech). 

Ca’* analyses were performed as described’*. In brief, 1 X 10° freshly trans- 
duced TKO cells expressing ERT2-SLP65 were loaded with Indo-1 (Invitrogen) 
using Pluronic (Invitrogen). Induction of ERT2-SLP65 was performed by the 
addition of 214M 4-OHT (Sigma-Aldrich). Goat anti-mouse IgM (101g ml’; 
Southern Biotech) was used for crosslinking of the BCR. For the inhibition of 
SYK, cells were pre-treated with 4 1M R406 (Rigel) for 1h. 

Single-cell Ca”* flux analysis. TKO cells expressing ERT2-SLP65 and BCRS3 or 
unmutated CLL2 were prepared as described”* and seeded onto coated eight-well 
u-Slides (ibidi) using BD Cell-Tak (BD Biosciences). Live-cell imaging of single 
cells was performed using a ScafiR screening station on an IX81 inverse micro- 
scope (Olympus). Measurements were taken at 37 °C and at time intervals of 6 min 
after induction with 4-OHT or 4-OHT and goat anti-mouse IgM (Southern 
Biotech) or goat anti-human IgM. For the inhibition of SYK, cells were pre-treated 
with 4 4M R406 (Rigel) for 1h. 

NF-«B activity assay. NF-«B activity was measured using the TransAM NF-«B 
p50 transcription factor assay kit (Active Motif). In brief, peripheral blood B cells 
from CLL patients were treated with 41M R406 for 1h or were left untreated. 
Total cellular lysates were applied to 96-well plates coated with oligonucleotides 
containing the NF-«B consensus sequence (5’-GGGACTTTCC-3’) and NF-«B 
activity was calculated by measuring p50 binding using a specific antibody. 
Quantitative reverse-transcriptase PCR. Total RNA was extracted, reverse- 
transcribed using poly-dT primers and amplified using MYC-specific primers 
and XIAP-specific primers designed by Universal ProbeLibrary Assay Design 
Center (Roche Applied Science). In addition, the amount of actin transcripts 
was quantified in all samples as an internal control. SYBR green was used in the 
detection method. 

Statistical analysis. For statistical analysis, GraphPad Prism software was used. 
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Organisms that protect their germ-cell lineages from damage often do so at considerable cost: limited metabolic 
resources become partitioned away from maintenance of the soma, leaving the ageing somatic tissues to navigate 
survival amid an environment containing damaged and poorly functioning proteins. Historically, experimental 
paradigms that limit reproductive investment result in lifespan extension. We proposed that germline-deficient 
animals might exhibit heightened protection from proteotoxic stressors in somatic tissues. We find that the forced 
re-investment of resources from the germ line to the soma in Caenorhabditis elegans results in elevated somatic 
proteasome activity, clearance of damaged proteins and increased longevity. This activity is associated with increased 
expression of rpn-6, a subunit of the 19S proteasome, by the FOXO transcription factor DAF-16. Ectopic expression of 
rpn-6 is sufficient to confer proteotoxic stress resistance and extend lifespan, indicating that rpn-6 is a candidate to 
correct deficiencies in age-related protein homeostasis disorders. 


In nature, food sources are largely unpredictable and insufficient. 
The constant pressures that limited energetic resources place on an 
organism have long been theorized to cause a significant life-history 
trade-off: the absolute need for repairing and preventing damage to 
the germ line, and for ensuring elimination of damage in progeny, 
necessarily dominates resource allocation strategies, whereas conversely 
little or no evolutionary pressure will be placed on the maintenance of 
the soma’. Thus, ageing, post-reproductive organisms that escape pre- 
dation witness the gradual deterioration of their own somatic tissues. In 
support of such theories, modulations of reproduction that eliminate 
germ cells provide effective mechanisms for extending lifespan””, 
phenotypes that may be caused by heightened resource availability 
within the post-mitotic soma. Likewise, it has been proposed that 
animals undergoing dietary restriction adopt a strategy in which 
resources are re-allocated towards somatic maintenance, extending 
lifespan and prolonging reproduction until conditions for survival 
become more favourable’. 

When proliferating germline cells of C. elegans are removed, worms 
live up to 60% longer than normal and seem to be resistant to a variety 
of environmental stressors”’. Whereas germline ablation affords an 
obvious protection, the downstream effectors of such protection 
remain ambiguous. The re-allocation of resources to the soma seems 
to be directed through a specific, genetically defined stress-responsive 
pathway. Germline removal extends lifespan by triggering an active 
signalling network, involving the nuclear localization and activation 
of DAF-16, a forkhead transcription factor (FOXO)® and the major 
downstream effector of the DAF-2-insulin-insulin-like growth factor 
(IGF) signalling (IIS) pathway. However, although worms with an 
ablated germ line exhibit a daf-16-dependent extension in lifespan, 
longevity caused by germline ablation functions in a synergistic 
manner with mutations in the IIS receptor, daf-2 (ref. 6). 
Additionally, in germline-ablated animals, but not daf-2 mutant worms, 
kri-1, daf-9 and the nuclear hormone receptor daf-12 are also required 
for the constitutive nuclear localization of DAF-16 (refs 9, 10). 


Notably, post-mitotic somatic cells are particularly susceptible to age- 
onset protein aggregation diseases. As the somatic cell ages, the accu- 
mulation of damaged proteins represents a particular challenge to the 
ageing cell, especially as they aggregate in inclusions and aggresomes 
capable of overwhelming the cellular machinery required for their 
degradation’*"*. These effects are probably compounded by age-related 
dysregulation of chaperones, a downregulation of degradation 
machinery itself, and a continually accelerating loss in general cellular 
homeostasis. As such, a rapid decline in the capacity of the cell to protect 
its proteome has been highly correlated with multiple age-related 
disorders’’. This conversely indicates that long-lived somatic cells, such 
as those found in a germline-ablated animal, might exhibit a heightened 
capacity for clearing damaged proteins, and that this proteostatic 
capacity might contribute to the increased longevity in these mutants. 


Increased proteasome activity in glp-1(e2141) worms 


We hypothesized that a key aspect of the proteostasis network, the 
ubiquitin proteasome system (UPS), might be altered in the soma of 
germline-ablated animals. To test this hypothesis, we first examined 
the activity of the 26S/30S proteasome in several long-lived mutants 
using a fluorogenic peptide substrate specific for the chymotrypsin- 
like activity of the proteasome (Supplementary Fig. 1). We found that 
glp-1(e2141) mutant worms, which lack their germ line, displayed a 
marked increase (over sixfold) in the chymotrypsin-like proteasome 
activity (Fig. la and Supplementary Figs 1 and 2a-c). The proteasome 
inhibitors MG-132, lactacystin and PI-I blocked activity from extracts 
in both the glp-1 mutant and the control strain (Supplementary Fig. 3), 
indicating that the increased peptidase activity in glp-1(e2141) worms 
was due to the proteasome. The caspase-like and trypsin-like activities 
were also increased in gip-1 mutant worms (Fig. 1b, c). Additionally, 
we found that a C. elegans genetic model of dietary restriction’ 
also induced proteasome activity, although to a lesser extent than the 
glp-1(e2141) mutation (Supplementary Fig. 1). In contrast, neither 
reduced IIS signalling by mutation of daf-2 nor reduced mitochondrial 
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Figure 1 | Germline-lacking nematodes have increased proteasome activity. 
a, Chymotrypsin-like activity of the proteasome monitored by Z-Gly-Gly-Leu- 
AMC digestion in day 7 adult worm extract containing equal amounts of total 
protein. Proteasome activity (relative slope to control strain fer-15(b26);fem- 
1(hc17)) is shown, with error bars indicating mean + s.e.m. (n = 50, 
*P = 1.83 X 10 |”). b, Caspase-like (Z-Leu-Leu-Glu-AMC) proteasome 
activity (relative slope) is shown, with error bars representing mean + s.e.m. 
(n = 8, *P < 0.00001). ¢, Trypsin-like (Ac-Arg-Leu-Arg-AMC) proteasome 
activity (relative slope) is shown, with error bars showing mean + s.e.m. (” = 7, 
*P < 0.00001). d, Representative polyubiquitinated protein immunoblot. 
a-Tubulin is used as the loading control. 


electron transport chain activity'’® upregulated proteasome activity 
(Supplementary Fig. 1). Consistent with increased proteasome activity 
in glp-1(e2141) worms, we observed decreased levels of polyubiquiti- 
nated proteins in these worms (Fig. 1d). To examine further UPS 
activity in living animals, we used a photoconvertible fluorescent 
UPS reporter system for live imaging and quantification of protein 
degradation in C. elegans'®. This proteasome reporter consists of the 
photoconvertible fluorescent protein Dendra2 targeted for proteasomal 
degradation by fusion to a mutant form of ubiquitin (UbG76V) that 
cannot be cleaved by ubiquitin hydrolases. Dendra2 can be irreversibly 
photoconverted from a green to a red fluorescent state, providing quan- 
tification of UPS activity independently of protein synthesis. We found 
that this reporter is degraded more rapidly in gip-1 mutant worms 
compared to control strains, whereas Dendra2 lacking the UbG76V 
signal remained stable (Supplementary Fig. 4). 

glp-1(e2141) worms differ from other types of reproductive mutants 
in that their entire germ line is missing. Notably, glp-1 mutants exhibit 
a significantly increased lifespan in comparison to worms that are also 
sterile but which still contain a proliferating germ line (Supplementary 
Fig. 5). We observed that the increased proteasome activity of glp-1 
mutants was not due to a benefit of sterility per se, because the normal- 
lived, sterile control fer-15(b26);fem-1(hc17) animals have similar pro- 
teasome activity as wild-type animals (Supplementary Figs 2a and 6). 
Furthermore, treatment with 5-fluoro-2'deoxyuridine (FUdR), a 
drug used to block progeny production in worms”, did not affect 
proteasome activity (Supplementary Fig. 6). The glp-1(e2141) allele is 
temperature-sensitive for reproduction and longevity'*; these worms 
are only long-lived when they are shifted to restrictive temperature 
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(25 °C) either during development or in early adulthood’. Accordingly, 
proteasome activity is not increased in glp-1(e2141) worms grown 
continuously at the permissive temperature (Supplementary Fig. 7). 
However, resembling the longevity phenotype’, glp-1(e2141) worms 
maintained high proteasome activity when down-shifted to a permis- 
sive temperature after germline removal (Supplementary Fig. 8). These 
results indicate that different forms of sterility do not have similar 
effects on proteasome activity, but are specific to loss of the germ line. 


DAF-16 regulates proteasome activity 


Because DAF- 16, the worm FOXO transcription factor, is essential for 
the increased longevity of glp-1 mutant worms’, we tested whether 
daf-16 was also required for the increased proteasome activity found 
in glp-1(e2141) animals. Proteasome activity of glp-1 mutant animals 
was suppressed to wild-type levels in daf-16;glp-1 double mutant 
animals (Fig. 2a-c). daf-16 is required during reproductive adulthood 
to modulate the ageing process in worms’”. Accordingly, daf-16 RNA 
interference (RNAi) treatment of glp-1(e2141) animals during 
adulthood decreased proteasome activity (Fig. 2d). In contrast, daf- 
16 RNAi did not affect proteasome activity in control strains (Fig. 2d 
and Supplementary Fig. 9), where DAF-16 is located in the cytosol 
and is inactive. Loss of daf-16 suppressed longevity and proteasome 
activity, but not the reproductive phenotype of glp-1(e2141) worms, 
providing further evidence that increased proteasomal activity could 
not be separated from the increased longevity mediated by DAF-16 in 
glp-1 mutants. We examined whether other genes required to pro- 
mote DAF-16/FOXO nuclear localization in the germline longevity 
pathway (that is, daf-12, daf-9 and kri-1) were also necessary for 
increased proteasome activity. Accordingly, reduction of any one of 
these genes in glp-1(e2141) worms resulted in decreased proteasome 
activity, although not to the extent of daf-16 reduction (Fig. 2e). 
Furthermore, reduction of daf-12, daf-9 or kri-1 did not further 
decrease proteasome activity of the daf-16;glp-1 double mutant 
animals or the control strain (Supplementary Fig. 10a, b). In addition 
to daf-16, glp-1-mediated longevity requires two additional transcrip- 
tion factors: hsf-1 (ref. 20) and skn-1 (Supplementary Fig. 11). hsf-1 
is required for the regulation of adult lifespan, heat-shock and 
proteotoxic stress*'”’; skn-1 is the worm orthologue of nrf-2 and has 
a central role in oxidative stress responses in worms, flies and 
mice~***. In contrast to daf-16, daf-12, daf-9 and kri-1, we observed 
that neither hsf-1 nor skn-1 was required for the increased proteasome 
activity in glp-1(e2141) worms (Fig. 2f, g). To uncover a possible 
redundancy of either hsf-1 or skn-1 with daf-16, we knocked down 
these factors in the daf-16;glp-1 double mutant animals. However, in 
the context of daf-16 loss, neither hsf-1 nor skn-1 further affected 
proteasome activity in glp-1(e2141) worms (Supplementary Fig. 12). 
The nuclear hormone receptor nhr-80 links fatty acid desaturation to 
lifespan extension through germline ablation in a daf-16-independent 
manner~. Consistent with a requirement for daf-16 in proteasome 
activity, nhr-80 was not required for increased proteasome activity in 
glp-1 mutant worms (Supplementary Fig. 13). Taken together, altera- 
tions that specifically affect DAF-16 activity, but not HSF-1, SKN-1 or 
NHR-80, alter proteasome activity in glp-1 mutants, indicating that a 
major output for DAF-16-mediated longevity in this mutant is to 
increase proteasome activity. 


DAF-16 regulates rpn-6.1 levels 

The 26S/30S proteasome consists of a 20S core structure that contains 
the proteolytic active sites and 19S cap structures that impart regu- 
lation on the activity of the holo-complex (26S, single capped, and 
30S, double capped)**. Although 20S particles can exist in a free form, 
20S particles in their most physiological form are inactive, unable to 
degrade denatured proteins or cleave peptides”. The 19S regulatory 
subunit is responsible for stimulating the 20S proteasome to degrade 
proteins, as ATPases of the regulatory particle open the 20S core, 
allowing substrates access to proteolytic active sites. Analysis of 
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Figure 2 | DAF-16 is required for proteasome activity in glp-1(e2141) 
mutant nematodes. a, Chymotrypsin-like proteasome activity (relative slope 
to fer-15(b26);fem-1(hc17)) is shown. Error bars indicate mean + s.e.m. 

(n = 11, *P < 0.00005). b, Caspase-like proteasome activity (relative slope to 
fer-15(b26);fem-1(hc17)) is shown. Error bars indicate mean + s.e.m. (n = 6, 
*P < 0.0001). ¢, Trypsin-like proteasome activity (relative slope to fer- 
15(b26);fem-1(hc17)) is shown. Error bars represent mean + s.e.m. (n = 6, 

*P < 0.005). d, glp-1(e2141) worms fed daf-16 RNAi bacteria show decreased 
chymotrypsin-like proteasome activity (P = 8.5 X 10° * gip-1 mutant fed vector 
RNAi bacteria versus gip-1 mutant fed daf-16 RNAi bacteria; n = 16). daf-16 
RNAi knock down does not affect proteasome activity in fer-15;fem-1 worms 
(P = 0.79 fer-15;fem-1 fed vector RNAi bacteria versus fer-15;fem-1 fed daf-16 
RNAi bacteria; n = 9). e, Chymotrypsin-like proteasome activity in glp- 
1(e2141) worms fed daf-16, daf-12, daf-9 or kri-1 RNAi bacteria (n = 4; vector 


the messenger RNA levels of the 20S proteasome subunits revealed 
that «-subunits were not increased in glp-1 mutants whereas only one 
of the B-subunits, pbs-5, was moderately increased (Supplementary 
Fig. 14 and Supplementary Table 1). PBS-5 is the B-type subunit that 
contains the chymotrypsin-like proteolytic active site*®. With regard 
to the 19S proteasome subunits, we did not detect an increase of the 
ATPase subunits (Fig. 3a and Supplementary Table 1). Notably, only 
one of the non-ATPase subunits showed increased mRNA levels in gip- 
1 mutant animals: rpn-6.1, an essential subunit for the activity of the 
26S/30S proteasome that stabilizes the otherwise weak interaction 
between the 20S core and the 19S cap”*° (Fig. 3a, b and Sup- 
plementary Table 1). mRNA levels of the closely related rpn-6.2 were 
not increased in g/p-1 mutants (Fig. 3a and Supplementary Table 1). 
rpn-6.1 had a threefold increase in its expression in g/p-1 mutants and 
was by far the most increased of all subunits. Accordingly, knock down 
of rpn-6.1 markedly decreased proteasome activity in glp-1(e2141) 
animals (Fig. 3c) similar to loss of daf-16. In contrast, loss of other 
non-ATPase subunits did not affect proteasome activity of glp-1 
mutants (Supplementary Fig. 15). Knock down of rpn-6.1 induced an 
upregulation in the expression of the rest of the 26S proteasome sub- 
units, probably to compensate for the reduction in proteasome activity 
induced by decreased levels of this critical subunit (Supplementary 
Table 2). Moreover, overexpression of rpn-6.1 in wild-type animals 


Time (min) 


Time (min) 


RNAi versus daf-16 RNAi (P = 4.17 X 10°); vector RNAi versus daf-12 RNAi 
(P < 0.01); vector RNAi versus daf-9 RNAi (P < 0.05); vector RNAi versus kri- 
1 RNAi (P< 0.01)). £, Chymotrypsin-like proteasome activity (relative slope to 
gip-1 mutant fed vector RNAi bacteria) is shown. Error bars indicate 

mean + s.e.m. (1 = 8; vector RNAi versus daf-16 RNAi (P < 0.0001); vector 
RNAi versus hsf-1 RNAi (P = 0.74)). g, skn-1 RNAi does not affect 
chymotrypsin-like proteasome activity of glp-1(e2141) worms (vector RNAi 
versus daf-16 RNAi (P < 0.00001); vector RNAi versus hsf-1 RNAi (P = 0.46)). 
Chymotrypsin-like proteasome activity (relative slope to glp-1 mutant fed 
vector RNAi bacteria) is shown. Error bars indicate the mean + s.e.m. (n = 9; 
vector RNAi versus daf- 16 RNAi (P < 0.00001); vector RNAi versus hsf-1 RNAi 
(P = 0.46)). All activities measured at day 5 of adulthood. All RNAi treatment 
was initiated at day 1 of adulthood. All statistical comparisons were made by 
Student’s t-test for unpaired samples. 


was sufficient to increase proteasome activity (Fig. 3d). The increase 
in rpn-6.1 MRNA levels did not alter the expression of other 26S pro- 
teasome subunits (Supplementary Table 3). Taken together, RPN-6.1 
seems to be a key component required for activation of the proteasome 
machinery of the germline-lacking nematodes. 

We found that DAF-16 is necessary for the increased expression of 
rpn-6.1 by analysing its mRNA levels in both daf-16;glp-1 double 
mutants and daf-16 RNAi-treated animals (Fig. 3e, Supplementary 
Figs 16 and 17 and Supplementary Table 4). Notably, mRNA levels 
of other proteasome subunits, including pbs-5, were not decreased by 
loss of daf-16 in glp-1 mutant worms (Supplementary Fig. 17). daf-16 
RNAi did not change rpn-6.1 expression in control worms 
(Supplementary Fig. 16). Therefore, these results display a correlation 
between daf-16 activity, rpn-6.1 levels and proteasome activity. 
Consistent with DAF-16 regulating the expression of rpn-6.1, we iden- 
tified a potential DAF- 16 binding site’! within the first intron of rpn-6.1 
(Supplementary Fig. 18). This site is supported by a DAF-16 binding 
region defined by the modENCODE project”, indicating that rpn-6.1 is 
likely to be a direct DAF-16 target. To explore further rpn-6.1 transcrip- 
tional regulation, we generated a transcriptional reporter construct. We 
found that rpn-6.1 was expressed in the pharynx and posterior intestine 
in control worms. Notably, rpn-6.1 expression increased markedly 
in the pharynx and throughout the intestine of g/p-1 mutants. In 
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Figure 3 | DAF-16 is necessary for increased expression of rpn-6.1 in glp-1 
mutants. a, Data represent the mean = s.e.m. of the relative expression levels 
to fer-15(b26);fem-1(hc17) (n = 11, P = 2.09 X 10°). b, Western blot analysis 
of RPN-2, RPN-6.1, RPN-8, RPT-6, «6 and a1+2+3+5+6+7. B-Actin is the 
loading control. c, glp-1 mutants fed rpn-6.1 RNAi bacteria starting day 1 of 
adulthood have decreased chymotrypsin-like proteasome activity (P < 0.05). 
Proteasome activity (relative slope to fer-15(b26);fem-1(hc17)) represents the 
mean = s.e.m. (n = 6). d, Increased chymotrypsin-like proteasome activity in 
rpn-6.1 overexpressing N2 worms (rpn-6.1, GFP(OE)) in day 1 adult worm 
extract (P< 0.001). Proteasome activity (relative slope to GFP(OE) worms) 
represents the mean + s.e.m. (n = 9). e, daf-16;glp-1 double mutants have 
decreased rpn-6.1 mRNA levels (P < 0.001). Graph represents the 

mean ~ s.e.m. (1 = 6). f, Representative images of RFP expressed under 
control of the rpn-6.1 promoter. rpn-6.1 is expressed in the pharynx and 
posterior intestine in N2 worms. Increased rpn-6.1 expression in glp-1(e2141) 
animals relative to N2. daf-16;glp-1 mutant worms have decreased rpn-6.1 
expression compared to g/p-1 mutants. DIC, differential interference contrast 
microscopy. Scale bar, 100 jim. g, Quantification of RFP signal intensity 
(mean = s.e.m. (n = 5)). glp-1(e2141) worms have increased rpn-6.1 
expression compared to N2 (P < 0.01) and daf-16(mu86);glp-1(e2141) double 
mutants (P < 0.05). All statistical comparisons were made by Student’s t-test 
for unpaired samples. 


daf-16;glp-1 double mutants, we found almost a twofold decreased 
expression of rpn-6.1 compared to gip-1(e2141) worms, although rpn- 
6.1 expression is still increased compared to wild-type worms (Fig. 3f, 
g). These results correlated with quantitative real-time polymerase 
chain reaction (qPCR) data indicating that daf-16 mutations decrease 
rpn-6.1 expression in glp-1 mutants, but not to control levels (Fig. 3e, g), 
suggesting that a potential additional factor may be necessary for rpn- 
6.1 expression in addition to DAF-16. 
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rpn-6.1 determines stress resistance 


With the strong connection between daf-16, a key ageing modulator, 
rpn-6.1, a key proteasomal factor, and increased proteasome activity 
in long-lived gip-1 mutant animals dependent upon both daf-16 and 
rpn-6.1, we asked what role, if any, does rpn-6.1 have in longevity. To 
assess the requirement for rpn-6.1 during lifespan, we conducted 
RNAi knock down of this gene in C. elegans. Because proteasomal 
function is required during larval development”, we initiated rpn-6.1 
RNAi treatment during adulthood, the time at which daf-16 is 
required for longevity assurance’. Knock down of rpn-6.1 substan- 
tially decreased the lifespan of g/p-1 mutant animals (Supplementary 
Fig. 19). However, loss of rpn-6.1 also decreased lifespan of both wild- 
type and sterile control animals (fer-15(b26);fem-1(hc17)). In addition 
to germ-cell loss, we also examined the effects of rpn-6.1 RNAi on 
other pathways that influence lifespan such as reduced IIS (daf- 
2(e1370) worms), reduced mitochondrial electron transport chain 
(isp-1(qm150) worms) and reduced food intake (eat-2(ad1116) 
worms). In all cases, knock down of rpn-6.1 substantially decreased 
lifespan, confirming that this gene is essential for the viability of adult 
animals, making lifespan analysis by rpn-6.1 loss of function difficult 
to interpret (Supplementary Fig. 19). 

To explore whether rpn-6.1 might have a positive role in longevity, 
we tested the impact of increased rpn-6.1 expression. We over- 
expressed rpn-6.1 in wild-type worms and conducted a series of 
physiological assays to measure the effects of rpn-6.1 overexpression 
(OE) on resistance to challenges of oxidative stress, heat-shock and 
ultraviolet (UV) damage, all of which are correlated with increased 
longevity. rpn-6.1(OE) nematodes were significantly more resistant, 
than control strains, to oxidative stress induced by growing the worms 
in the presence of paraquat (Fig. 4a and Supplementary Fig. 20a). 
Under heat stress (34 °C), rpn-6.1(OE) worms lived markedly longer 
than control strains (Fig. 4b and Supplementary Fig. 20b). 
Interestingly, increased levels of rpn-6.1 did not result in global upre- 
gulation of all stress responses because rpn-6.1(OE) did not protect 
against UV damage (Fig. 4c and Supplementary Fig. 20c). Because 
overexpression of rpn-6.1 increased resistance to conditions that 
challenge the proteome, we examined whether it also resulted in 
lifespan extension. rpn-6.1(OE) did not extend the lifespan of worms 
at 20°C but it did at 25 °C, a temperature that results in mild heat 
stress (Fig. 4d and Supplementary Fig. 20d). daf-16 RNAi treatment 
blocked the lifespan extension induced by rpn-6.1(OE) at 25°C 
(Fig. 4e and Supplementary Fig. 20e). Therefore, under conditions 
of proteome stress, overexpression of rpn-6.1 is sufficient to promote 
increased survival. As a more formal test, we asked whether animals 
with a reduced heat-shock response via hsf-1 downregulation had 
increased survival when rpn-6.1 was overexpressed. hsf-1-RNAi- 
treated rpn-6.1(0E) worms were long-lived compared to control 
strains under the same treatment (Fig. 4f and Supplementary Fig. 20f). 
This last result not only indicates that the lifespan extension induced by 
rpn-6.1(OE) is hsf-1 independent, but also suggests that these worms can 
significantly overcome the loss of this critical transcription factor 
required for adult lifespan, heat-shock and proteotoxicity responses. 

Intrigued by the protection that rpn-6.1 overexpression could 
confer, we hypothesized that RPN-6.1 could be a potential candidate 
to correct protein homeostasis deficiencies underlying diseases such 
as Alzheimer’s, Parkinson’s or Huntington’s disease. Because the 
latter disease has been associated with proteasome failure™, we tested 
whether increased levels of rpn-6.1 could have beneficial effects in a 
polyglutamine (polyQ) disease model. Worm motility is markedly 
reduced by the aggregation of polyQ expression in neurons, with a 
pathogenic threshold at a length of 35-40 glutamines*. Notably, 
rpn-6.1 overexpression substantially improved motility and reduced 
toxicity of worms expressing polyQ40 and polyQ67 (Fig. 5a and 
Supplementary Fig. 21). In addition, loss of rpn-6.1 had a detrimental 
effect on the motility phenotype of polyQ67 worms even at early (day 3) 
adulthood stages (Fig. 5b and Supplementary Fig. 22). Furthermore, 
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Figure 4 | rpn-6.1 is a determinant of stress resistance and viability. a, rpn- 
6.1 overexpressing (OE) worms live longer than controls under oxidative stress 
(log rank, P< 0.0001, GFP(OE): mean = 2.97 + 0.10, n = 43/60; rpn-6.1, 
GFP(OE): mean = 4.89 + 0.17, n = 40/59). b, rpn-6.1(OE) worms live longer 
than controls under heat stress conditions (34 °C) (log rank, P < 0.0001, 
GFP(OE): mean = 16.20 + 0.47, n = 60/60; rpn-6.1, GFP(OE): 

mean = 29.83 + 0.99, n = 60/60). c, Ultraviolet stress assay on rpn-6.1(OE) 
worms (log rank, P = 0.06, GFP(OE): mean = 6.01 + 0.27, n = 59/60; rpn-6.1, 
GFP(OE): mean = 6.57 + 0.19, n = 58/59). d, rpn-6.1(OE) does not affect 
lifespan at 20 °C (log rank, P = 0.15, GFP(OE): mean = 16.33 + 0.34, n = 90/ 
100; rpn-6.1, GFP(OE): mean = 17.05 + 0.34, n = 95/100). rpn-6.1 
overexpression extends lifespan at 25°C (log rank, P < 0.0001, GFP(OE): 
mean = 10.19 + 0.32, n = 109/120; rpn-6.1, GFP(OE): mean = 12.73 + 0.29, 


we observed by filter trap analysis that rpn-6.1(OE) reduced polyQ 
aggregate levels whereas polyQ67 total protein levels remained con- 
stant (Fig. 5c), suggesting that rpn-6.1 specifically reduces aggregated, 
but not soluble, polyQ proteins. 


Discussion 


A growing body of evidence suggests that the protective modulation 
of various nodes of the proteostasis network, including the heat-shock 
response and autophagy, can contribute to the extended lifespan 
caused by the IIS***’, diet restriction*® and germline-signalling 
pathways”. We report evidence for the requirement of an upregulated 
proteasome activity in the extended lifespan of germline-deficient 
nematodes. Our initial analysis of proteasome activity among differ- 
ent longevity models in the worm reveals that only g/p-1 mutant and 
diet-restricted animals share an increased proteasome activity, and we 
hypothesize that these animals may share a strategy in which 
resources are actively re-allocated from the germ line to the soma, 
resulting in an enhanced protection of the proteome within somatic 
cells. Furthermore, we find distinct differences in the proteasome 
activity between gip-1 and daf-2 mutant animals that is mediated by 
DAF-16, and in part by KRI-1, DAF-12 and DAF-9, confirming 
previous genetic suggestions that DAF-16 activity is differentially 
regulated between gip-1 and daf-2 mutants. Mechanistically, in 
germline-deficient animals, rpn-6.1 and subsequent increases in 
proteasome activity seem to be direct downstream targets of DAF- 
16/FOXO. Our results thus provide new insights into proteostasis 
regulation and provide a link between the longevity regulator DAF- 
16 and proteasome activity regulation upon rpn-6.1 expression. 

We further define RPN-6 as a potent factor to increase resistance to 
proteotoxic stress, as its upregulation can delay the deleterious effects of 
strong adverse conditions. It is intriguing to speculate that one method 
to ensure survival of the soma may be the direct activation of FOXO/ 
daf-16, under limited nutrient availability or loss of the germ line, 
resulting in increased rpn-6.1 levels and increased proteome mainten- 
ance. Recently, it has been reported that changes in the proteasome may 
explain why ageing is a risk factor for neurodegenerative diseases such 
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n= 113/120). e, daf-16 RNAi (starting at day 1 of adulthood) blocks lifespan 
extension induced by rpn-6.1 overexpression (rpn-6.1, GFP(OE) fed vector 
RNAi bacteria versus rpn-6.1, GFP(OE) fed daf-16 RNAi bacteria, log rank, 
P<0.0001). GFP(OE) fed vector RNAi bacteria: mean = 11.36 + 0.45, n = 86/ 
100; GFP(OE) fed daf-16 RNAi bacteria: mean = 10.59 + 0.36, n = 77/100; 
rpn-6.1, GFP(OE) fed vector RNAi bacteria: mean = 13.27 + 0.37, n = 87/100; 
rpn-6.1, GFP(OE) fed daf-16 RNAi bacteria: mean = 10.59 + 0.28, n = 88/100. 
f, hsf-1-RNAi-treated rpn-6.1(OE) worms were long-lived compared to 
controls (log rank, P< 0.0001). RNAi was initiated at day 1 of adulthood. 
GFP(OE) fed hsf-1 RNAi bacteria: mean = 7.74 + 0.09, n = 93/99; rpn-6.1, 
GFP(OE) fed hsf-1 RNAi bacteria: mean = 9.92 + 0.13, n = 94/100. See 
Supplementary Table 5 for statistical analysis and replicate data of stress assays 
and lifespan experiments. 
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Figure 5 | rpn-6.1 protects from polyglutamine aggregation. a, rpn-6.1 
improves motility in polyQ67 worms. Bar graphs represent average (+s.e.m.) 
thrashing over a 30-s period on day 1 (P< 0.0001, GFP(OE);Q67 (n = 40), rpn- 
6.1,GFP(OE);Q67 (n = 41)), day 3 (P < 0.0001, GFP(OE);Q67 (n = 40), rpn- 
6.1,GFP(OE);Q67 (n = 44)) and day 5 (P < 0.05, GEP(OE);Q67 (n = 31), rpn- 
6.1,GFP(OE);Q67 (n = 39)) of adulthood. b, Loss of rpn-6.1 has a detrimental 
effect on the motility defects of polyQ67 worms. Bar graphs represent average 
(+s.e.m.) thrashing over a 30-s period on day 1 (P = 0.15, Q67 fed vector RNAi 
bacteria (n = 58), Q67 fed rpn-6.1 RNAi bacteria (n = 30)), day 3 (P< 0.0001, 
Q67 fed vector RNAi bacteria (n = 44), Q67 fed rpn-6.1 RNAi bacteria (n = 51)) 
and day 5 (P< 0.0001, Q67 fed vector RNAi bacteria (n = 41), Q67 fed rpn-6.1 
RNAi bacteria (n = 47)) of adulthood. All statistical comparisons were made by 
Student’s t-test for unpaired samples. c, Filter trap analysis indicates that rpn-6.1 
overexpression results in reduced polyQ aggregates (detected by anti-GFP 
antibody). Right panel: SDS-PAGE analysis with antibodies to GFP, RPN-6 and 
a-tubulin loading control. *P < 0.05; **P < 0.0001. 
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as Alzheimer’s, Parkinson’s and Huntington’s disease*’. Therefore, 
RPN-6 may bea powerful candidate to correct deficiencies in disorders 
associated with a failure in protein homeostasis. It will be of crucial 
interest to explore in mammalian models whether RPN-6 could indeed 
alleviate the associated symptoms of these disorders. 


METHODS SUMMARY 


Caenorhabditis elegans were cultured using standard techniques*' and fed on 
Escherichia coli OP50 or HT115 containing a double-stranded-RNA-expressing 
plasmid”. 
26S proteasome activity assays. In vitro 26S proteasome activity assays were 
performed as previously described”. Worms were lysed in proteasome activity 
assay buffer (50 mM Tris-HCl, pH 7.5, 250mM sucrose, 5mM MgCl, 0.5 mM 
EDTA, 2mM ATP and 1 mM dithiothreitol) using a Precellys 24 homogenizer 
(Bertin technologies). Lysate was centrifuged at 10,000g for 15 min at 4 °C. A total 
of 25 1g of total protein lysate was transferred to a 96-well microtitre plate (BD 
Falcon) and incubated with fluorogenic substrate. Fluorescence (380-nm excita- 
tion, 460-nm emission) was monitored on a microplate fluorometer (Infinite 
M1000, Tecan) every 5 min for 1h at 25°C. 
Motility assay. Thrashing rate was determined as previously described**. Worms 
were transferred to a drop of M9 buffer and after 30 s of adaptation the number of 
body bends was counted for 30s. A body bend was defined as a change in 
direction of the bend at the midbody of a nematode’. 
Filter trap. Worm extracts were generated by glass bead disruption on ice in non- 
denaturing lysis buffer (50 mM HEPES, pH7.4, 150mM NaCl, 1mM EDTA, 
0.5% Triton X-100) supplemented with EDTA-free protease inhibitor cocktail 
(Roche). Lysate was centrifuged at 5,000g for 5 min. A total of 70 ug of protein 
extract was supplemented with SDS at a final concentration of 1% and loaded 
onto a cellulose acetate membrane assembled in a slot blot apparatus (BioRad). 
The membrane was washed with 0.1% SDS and retained Q67-GFP was assessed 
by immunoblotting for GFP (Roche). 

A detailed description of all experimental methods including C. elegans strains, 
growth, imaging, lifespan analysis, stress assays and RNAi application is provided 
in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Caenorhabditis elegans strains and generation of transgenic lines. CF512 (fer- 
15(b26)IIfem-1(hc17)IV),  CB4037 —(glp-1(e2141)II), | AU147 —(daf- 
16(mgDf47)Lglp-1(e2141)II), CF1880 (daf-16(mu86)I.glp-1(e2141)II), DA1116 
(eat-2(ad1116)II) and wild-type (N2) C. elegans strains were obtained from the 
Caenorhabditis Genetic Center. AGD151 (eat-2(ad1116)I; fer-15(b26)I;fem- 
1(hc17)IV) was generated by crossing CF512 with DA1116 (eat-2(ad1116)II). 
CF596 (daf-2(mu150)III, fer-15(b26)II;fem-1(hc17)IV) was a gift from C. 
Kenyon. C. elegans were handled using standard methods*'. 

For the generation of worm strains AGD597-AGD598 (N2, uthEx556[psur5::rpn- 
6.1, pmyo3::GFP] and N2, uthEx556[psur5::rpn-6.1, pmyo3::GFP]), a DNA plasmid 
mixture containing 75ngul' pDV1 (psur5::rpn-6.1) and 20ng yl ' pPD93_97 
(pmyo-3::GFP) was injected into the gonads of adult N2 hermaphrodite animals, 
using standard methods“. GFP-positive F; progeny were selected. Individual F, 
worms were isolated to establish independent lines. Control worms (AGD614) used 
in experiments with AGD597-AGD598 were generated by microinjecting N2 worms 
with 20 ng wl? pPD93_97 (pmyo-3::GFP). AGD886 (fer-15(b26)II; fem-1(hc17)IV; 
uthEx557[psur5::rpn-6.1, pmyo3::GEP]) was generated by crossing AGD598 to CF512. 
Control strain AGD885 (fer-15(b26)II;fem-1(hc17)IV; uthEx633[myo3p::GFP]) 
was generated by crossing AGD614 to CF512. 

Both AM101  (rmlIs110[pF25B3.3::Q40::YFP]) and AM716 = (rmIs284 
[pF25B3.3::Q67::YFP]) were a gift from R. I. Morimoto. For the generation of 
worm strains AGD850 (rmIs110[pF25B3.3::Q40::YFP];uthEx557[psur5::rpn- 
6.1,pmyo3::GFP]) and AGD851_  (rmIs284[pF25B3.3::Q67::YFP];uthEx557 
[psur5p::rpn-6.1l,pmyo3::GFP]), AGD598 strain was crossed to AM101 and 
AM716, respectively. Control strains AGD866 (rmIs110[pF25B3.3::Q40::YFP]; 
uthEx633[pmyo3::GFP]) and AGD867 (rmIs284[pF25B3.3::Q67:: YFP];;uthEx633 
[pmyo3::GFP]) were generated by crossing AGD614 to AM101 and AM716, 
respectively. 

For the generation of worm strains AGD945-AGD946 (N2, uthEx649[rpn- 
6p::tdTomato, pRF4(rol-6)] and N2, uthEx650[rpn-6p::tdTomato, pRF4(rol-6)]), a 
DNA plasmid mixture containing 75ngpl ' pDV2 (rpn-6p::tdTomato) and 
20ng ul * pRF4(rol-6) was injected into the gonads of adult N2 hermaphrodite 
animals. Roller-phenotype-positive F, progeny were selected. Individual F, 
worms were isolated to establish independent lines. AGD1047 (glp-1(e2141)II; 
uthEx649[rpn-6p::tdTomato, pRF4(rol-6)) was generated by crossing AGD945 to 
CB4037. AGD 1048 (daf-16(mu86)Iglp-1(e214 1) II; uthEx649[rpn-6p::tdTomato, 
pRFA4(rol-6)) was generated by crossing AGD945 to CF1880. 

YD1 (N2, xzEx1[Punc-54::Dendra2]) and YD3 (N2, xzEx3[Punc- 
54::UbG76V::Dendra2]) were a gift from C. I. Holmberg. AGD1032 (glp- 
1(e2141)III; xzEx1[Punc-54::Dendra2]) was generated by crossing YD1 to 
CB4037. AGD1033 (glp-1(e2141)II; xzEx3[Punc-54::UbG76V::Dendra2]) was 
generated by crossing YD3 to CB4037. AGD1036 (fer-15(b26)II;fem-1(hc17)IV; 
xzEx1[Punc-54::Dendra2]) was generated by crossing YD1 to CF512. AGD1037 
(fer-15(b26)II;fem-1(hc17)IV;_ xzEx3[Punc-54::UbG76V::Dendra2]) was gener- 
ated by crossing YD3 to CF512. 

Construction of rpn-6.1 expression construct. To construct pDV1, the rpn-6.1 
C. elegans expression plasmid pPD95.77 from the Fire Lab kit was digested with 
SphI and XmalI to insert 3.6 kilobases of the sur5 promoter. The resultant vector 
was then digested with KpnI and EcoRI to excise GFP and insert a multi-cloning 
site containing KpnI, Nhel, NotI, XbaI and EcoRI. F57B9.10.A (rpn-6.1) was PCR 
amplified from cDNA to include 5’ XmalI and 3’ XbaI restriction sites then cloned 
into the aforementioned vector. All constructs were sequence verified. 
Construction of rpn-6.1 transcriptional reporter construct. To construct 
pDV2, pPD95.77 from the Fire Lab kit was digested to replace GFP with 
tdTomato. The promoter region and first intron of F57B9.10.A (rpn-6.1) was 
PCR amplified from N2 gDNA to include —363 to +1012 then cloned into the 
aforementioned vector using SalI and BamHI. The construct includes 46 nucleo- 
tides of exon 1. Construct was sequence verified. 

RNAi constructs. RNAi-treated strains were fed E. coli (HT115) containing an 
empty control vector (L4440) or expressing double-stranded RNAi®. daf-12, rpn- 
2, rpn-6.1, rpn-11 and skn-1 RNAi constructs used were taken from the Vidal 
RNAi library. cco-1, rpn-1, nhr-80, daf-9, hsf-1 and kri-1 RNAi constructs used 
were from the Ahringer RNAi library. pAD43, the daf-16 RNAi construct, was 
previously described’’. See Supplementary Table 6 for further details about 
double-stranded RNAi constructs used for knockdown assays. 

Lifespan studies. Lifespan analyses were performed as described previously’. 
Worms were synchronized by egg laying during 2 h. Animals were grown at 20 °C 
until day 1 of adulthood. One-hundred animals were used per condition and 
scored every day or every other day. Lifespans were conducted at either 20 °C or 
25 °C as stated in the figure legends. For non-integrated lines AGD597, AGD598 
and AGD886, GFP-positive worms were selected for lifespan studies. JMP IN 8 
software was used for statistical analysis to determine means and percentiles. In 
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all cases, P values were calculated using the log-rank (Mantel-Cox) method. See 
Supplementary Table 5 for statistical analysis and replicate data. 

Stress assays. For heat-shock assays, eggs were transferred to plates seeded 
with E. coli (OP50) bacteria and grown to day 1 of adulthood at 20°C. Worms 
were then transferred to fresh plates and heat shocked at 34°C. Worms were 
checked every hour for viability. Paraquat assays were performed as previously 
described”. Briefly day-1 adults were transferred to plates containing 7.5 mM 
paraquat and cultured at 25 °C. Worms were checked every day for viability. For 
UV irradiation assays**, day-5 adult worms were transferred to plates without 
OP50 and exposed to 1,200 Jm! of UV using a UV Stratalinker. Worms were 
transferred back to fresh plates seeded with E. coli (OP50) and scored daily for 
viability. 

Motility assay. Thrashing rate was determined as previously described”. 
Animals were grown at 20 °C until L4 stage and then grown at 25 °C for the rest 
of the experiment. Worms were fed with E. coli (OP50) bacteria. RNAi-treated 
strains were fed E. coli (HT115) containing an empty control vector (L4440) or 
expressing double-stranded RNAi of the rpn-6.1 gene. Worms were transferred at 
day 1, 3 or 5 of adulthood to a drop of M9 buffer and after 30 s of adaptation the 
number of body bends was counted for 30 s. A body bend was defined as change in 
direction of the bend at the midbody of an animal”. 

26S proteasome fluorogenic peptidase assays. In vitro 26S proteasome activity 
assays were performed as previously described’’. Briefly, worms were lysed in 
proteasome activity assay buffer (50mM Tris-HCl, pH7.5, 250mM sucrose, 
5mM MgCh, 0.5mM EDTA, 2mM ATP and 1 mM dithiothreitol) using a 
Precellys 24 homogenizer (Bertin technologies). Lysate was centrifuged at 
10,000g for 15min at 4°C. For each experiment, 25 1g of total protein lysate 
was transferred to a 96-well microtitre plate (BD Falcon) then fluorogenic 
substrate was added. For measuring the chymotrypsin-like activity of the 
proteasome either Z-Gly-Gly-Leu-AMC (Enzo) or Suc-Leu-Leu-Val-Tyr-AMC 
(Enzo) was used. Z-Leu-Leu-Glu-AMC (Enzo) was used to measure the caspase- 
like activity of the proteasome and Ac-Arg-Leu-Arg-AMC for the proteasome 
trypsin-like activity. Fluorescence (380-nm excitation, 460-nm emission) was 
monitored on a microplate fluorometer (Infinite M1000, Tecan) every 5 min 
for Lh at 25°C. 

Western blot. For each strain, 2,000 adult worms were collected in proteasome 
assay activity buffer supplemented with protease inhibitors (Roche) and lysed 
using a Precellys 24 homogenizer. Lysate was centrifuged at 10,000g for 15 min at 
4 °C. 40 ug of total protein was resolved by SDS-PAGE and transferred to nitro- 
cellulose membrane. Western blot analysis was performed with anti-20S alpha 
1-7 (Abcam), anti-proteasome 20S C2 (Abcam), anti-Rpt6 (Enzo), anti-Rpt5 
(Enzo), anti-PSMD7 (Abcam), anti-Rpn2 (Abcam), anti-PSMD11 (Novus), 
anti-FK1 (Enzo), GFP (Roche), anti-o-tubulin (Sigma) and anti-f-actin 
(Abcam). 

Filter trap. Animals were grown at 20 °C until L4 stage and then grown at 25 °C 
for the rest of the experiment. Day 1 adult worms were collected with M9 buffer 
and worm pellets were frozen with liquid N2. Frozen worm pellets were thawed 
on ice and worm extracts were generated by glass bead disruption on ice in non- 
denaturing lysis buffer (50 mM HEPES, pH7.4, 150mM NaCl, 1mM EDTA, 
0.5% Triton X-100) supplemented with EDTA-free protease inhibitor cocktail 
(Roche). Worm and cellular debris were removed with 5,000g spin for 5 min. 
Approximately 70 pg of protein extract was supplemented with SDS at a final 
concentration of 1% and loaded onto a cellulose acetate membrane assembled in a 
slot blot apparatus (BioRad). The membrane was washed with 0.1% SDS and 
retained Q67-GFP was assessed by immunoblotting for GFP (Roche). Extracts 
were also analysed by SDS-PAGE to determine protein expression levels. 
Microscopy, image analysis, equipment and settings. Newly hatched 
larvae were grown at 25°C until day 3 of adulthood. These young adults were 
mounted at room temperature (20-23°C) on a 10% agarose pad on glass 
slides with 1 pl of M9, covered with cover slip. For imaging, Zeiss Axiovert 
microscope and AxioCam with software AxioVision Rel. 4.7 was used. Images 
of whole worms were acquired with 100.45 numerical aperture (NA) 
plan-apochromat objectives. Photoconversion was carried out using a 405-nm 
filter and an EXFO X-Cite 120Q metal halide lamp with 100% output for 60s. 
Worms were imaged before and after photoconversion, and then were 
recovered on feeding plates at 20°C. After 24h, photoconverted worms were 
imaged with the same setting. Fluorescence intensities were analysed with 
AxioVision Rel. 4.7. 

RNA isolation and quantitative RT-PCR. Total RNA was isolated from syn- 
chronized populations of approximately 2,000 day-5 adults. Total RNA was 
extracted using TRIzol reagent (GIBCO). cDNA was generated using 
Quantitect Reverse Transcriptase kit (Qiagen). SybrGreen real-time qPCR 
experiments were performed with a 1:20 dilution of cDNA using an ABI 
Prism79000HT (Applied Biosystems) following the manufacturer’s instructions. 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 
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HDAC8 mutations in Cornelia de Lange syndrome 
affect the cohesin acetylation cycle 
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Cornelia de Lange syndrome (CdLS) is a dominantly inherited con- 
genital malformation disorder, caused by mutations in the cohesin- 
loading protein NIPBL'” for nearly 60% of individuals with classical 
CdLS**, and by mutations in the core cohesin components SMC1A 
(~5%) and SMC3 (<1%) for a smaller fraction of probands®”’. In 
humans, the multisubunit complex cohesin is made up of SMC1, 
SMC3, RAD21 and a STAG protein. These form a ring structure 
that is proposed to encircle sister chromatids to mediate sister 
chromatid cohesion® and also has key roles in gene regulation’. 
SMC3 is acetylated during S-phase to establish cohesiveness of 
chromatin-loaded cohesin’®"'’, and in yeast, the class I histone 
deacetylase Hosl deacetylates SMC3 during anaphase’*®. Here 
we identify HDACS8 as the vertebrate SMC3 deacetylase, as well as 
loss-of-function HDAC8 mutations in six CdLS probands. Loss 
of HDACS8 activity results in increased SMC3 acetylation and 
inefficient dissolution of the ‘used’ cohesin complex released from 
chromatin in both prophase and anaphase. SMC3 with retained 
acetylation is loaded onto chromatin, and chromatin immunopre- 
cipitation sequencing analysis demonstrates decreased occupancy 
of cohesin localization sites that results in a consistent pattern of 
altered transcription seen in CdLS cell lines with either NIPBL or 
HDAC8 mutations. 

Human SMC3 is acetylated by ESCO1 and ESCO2, homologues of 
yeast Ecol, and has been shown to be important for the establishment 
of sister chromatid cohesion'’®'"*'”!*, Using a monoclonal antibody 
specific for acetylated SMC3 (SMC3-ac)’*, we found that whereas total 
SMC3 levels remain stable throughout the cell cycle, SMC3-ac rapidly 
disappears during mitosis, suggesting coordinated deacetylation 
(Supplementary Fig. 1). 

We therefore used RNA interference (RNAi)-based screening of all 
known human histone deacetylases (HDACs) and sirtuins to identify 
HDACS8 as the vertebrate SMC3 deacetylase (Supplementary Fig. 2). 
Loss of HDACS8 activity using either HDAC8 RNAi or the HDAC8- 
specific inhibitor PCI-34051 (PCI; Fig. 1a, b) does not alter cell cycle 
progression, but clearly increases SMC3-ac in both soluble and 
chromatin fractions throughout the cell cycle (Fig. 1c, lanes 4 and 6, 


Fig. le, lanes 18-22 and 29-33, and Supplementary Fig. 3d, lanes 
22-28 and 36-42). Nearly all HDACS is present in the soluble fraction 
in both asynchronous and synchronized cultures (Fig. 1c, e). These 
data indicate that HDAC8 is present and active throughout the cell 
cycle, and that soluble SMC3-ac is its deacetylation target, similar to 
Hosl in yeast'*"'*. Notably, the increase of SMC3-ac in the soluble 
fraction in the absence of HDAC8 activity suggests that SMC3-ac 
dissociates from chromatin but fails to be deacetylated. In addition, 
we unexpectedly observed few sister chromatid cohesion defects with 
loss of HDACS8 activity alone (Supplementary Fig. 4). 

To understand the role of HDACS8 in genome-wide regulation of 
cohesin dynamics, we performed chromatin immunoprecipitation 
sequencing (ChIP-Seq) analysis of synchronized HeLa cells transfected 
with control or HDAC8 RNAi (Fig. 2) and immunoprecipitated with 
either an anti-RAD21 antibody to detect total cohesin or an anti- 
SMC3-ac antibody. Although total cellular cohesin shows no decrease 
(Supplementary Fig. 5a, b), and there is a high degree of overlap 
between SMC3-ac, cohesin and CTCF” localization sites in treated 
and untreated cells, high read numbers and tight correlations between 
experimental replicates enabled us to identify a 17% loss of total 
cohesin localization peaks with reduced HDAC8 activity (Fig. 2a—c 
and Supplementary Fig. 5d-h). Furthermore, despite using conditions 
that increase total SMC3-ac more than twofold (Fig. le and 
Supplementary Fig. 3d), we note a 16% loss of SMC3-ac localization 
sites with HDAC8 reduction (Fig. 2a-d and Supplementary Fig. 5f, g). 
Finally, we found that in both control and HDAC8-depleted cells, 
SMC3-ac preferentially localizes to downstream regions of genes 
relative to the distribution of RAD21 (Fig. 2c, d and Supplementary 
Fig. 5f, g). Together, this data demonstrates decreased occupancy of 
cohesin localization sites with the loss of HDAC8 activity, an obser- 
vation similarly noted for NIPBL haploinsufficient CdLS cells”. 

With the known role of cohesin regulation in CdLS and the obser- 
vations that a reduction in either HDACS8 or NIPBL leads to decreased 
cohesin occupancy of localization sites, we proposed that HDAC8 
mutations may cause CdLS. We screened this X-linked gene in 154 
individuals with CdLS negative for mutations in NIPBL, SMC1A and 
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Figure 1 | HDAC8 is an SMC3 deacetylase. a, HeLa cells were transfected 
with HDAC8 short interfering RNA (siRNA; lanes 1-4) or incubated with 

25 uM PCI, the HDAC8-specific inhibitor (lanes 5-9), for the indicated times. 
Total cell lysates were prepared and analysed by immunoblotting using anti- 
SMC3-ac, SMC3 and HDAC8 antibodies. Numbers beneath SMC3-ac bands 
indicate quantification of SMC3-ac levels normalized to SMC3 levels and the 0 
h time point. b, Acetylated SMC3 was prepared by co-immunoprecipitation 
with SMC1A from HeLa cell chromatin extracts. Immunoprecipitates (IP) were 
incubated with recombinant purified HDAC8 (rHDACS8) or mutant (mut) 
HDACS protein at 30°C for 1h and analysed by immunoblotting as in a. 


SMC3, as well as RAD21 (also known as SSC1), STAG2, ESCO1, ESCO2 
and MA U2 (also known as SCC4). We identified four de novo missense 
mutations and one de novo nonsense mutation in HDAC8 (Sup- 
plementary Table 1 and Fig. 3a). In addition, one familial mutation 
(c.1001A>G; p.H334R) was identified in a boy, his mildly affected 
sister and his unaffected mother, in which the mutant allele was 
inactivated in her blood. This mutation was also one of the de novo 
mutations in an unrelated girl. None of the mutations was seen in 290 
ethnically matched control chromosomes or in 629 individuals of the 
1000 Genomes Project*!. Despite the small numbers and the varied 
clinical features in females due to random X-chromosome inactiva- 
tion, these children demonstrate growth, cognitive and facial features 
consistent with those caused by mutations in NIPBL (‘classical’ CdLS). 
Both expression studies (Supplementary Fig. 6a) and X-chromosome 
inactivation studies (data not shown) demonstrate complete skewing 
towards the normal allele in the blood of females with HDAC8 muta- 
tions, indicating strong selection against the mutation. This limited 
the number of cell lines available for subsequent studies. However, 
immunoblotting demonstrated minimal HDACS8 protein expression in 
lymphoblastoid cell lines (LCLs) from a hemizygous boy with a p.G320R 
mutation as well as the skin fibroblasts from a female lyonized to express 
a p.H180R mutant allele (Fig. 3b), indicating protein instability in each 
case. Consistent with this, assessment of SMC3-ac demonstrates 
increased levels in both the p.G320R LCLs and the p.H180R fibroblasts, 
whereas the total amount of SMC3 (Fig. 3b) and cell cycle distribution for 
the LCLs are unchanged (Supplementary Fig. 6b). 

We expressed the HDAC8 missense mutations in Escherichia coli, 
purified and assayed each for deacetylase activity. These data (Fig. 3c) 
demonstrate that the p.H180R mutation severely abrogates HDAC8 
deacetylase activity, and demonstrate significant activity losses for 


314 | NATURE | VOL 489 | 13 SEPTEMBER 2012 


Chr., chromatin; Sup., supernatant; WT, wild type. c, Unsynchronized HeLa 
cells transfected with HDAC8 siRNA for 48 h were fractionated into soluble and 
chromatin fractions and immunoblotted as in a. d, HeLa cells were 
synchronized by double thymidine arrest, and released in the presence or 
absence of 25 1M PCI for the indicated time and verified by FACS analysis for 
cell cycle progression. e, Cell extracts of the PCI-treated cells in d were 
fractionated into soluble and chromatin fractions and analysed by 
immunoblotting as in a. Histone H3 Ser 10 phosphorylation (H3S10P) is a 
marker of prophase and the onset of mitosis. 


the p.G320R, p.T311M and p.H334R mutations, consistent with high 
conservation of each residue and position in the structure of HDAC8- 
substrate complexes**”* (Fig. 3d and Supplementary Fig. 6c-e). Finally, 
purified wild-type, but not mutant HDAC8, can rescue the SMC3 
overacetylation seen in HDAC8 mutant LCLs (Supplementary 
Fig. 6g, h). 

We have previously demonstrated that mutations in NIPBL, the 
primary cause of CdLS, result in consistent, reproducible changes of 
genes expressed from LCLs derived from individuals with CdLS”. 
Using Nanostring multiplex expression analysis to avoid variation 
introduced by PCR or RNA amplification-based assessments and a 
previously validated 32-gene CdLS classifier set”, we compared the 
expression of LCLs from the male with the p.G320R mutation and 
from two girls (with the p.H180R and p.H334R mutations) who 
express only the normal allele in these cells, to 10 normal controls 
and 12 LCLs with loss of function NIPBL mutations. In this assay, 
the aggregate 32-gene LCL expression profile of the single male with 
an HDAC8 mutation strongly correlates with that seen in NIPBL- 
mutant cell lines, whereas the profile for the two female lines, which 
express the wild-type allele, correlates with normal controls (Fig. 3e). 
These data support the hypothesis that loss of HDAC8 activity results 
in widespread transcriptional dysregulation as seen in NIPBL-mutated 
CdLS cells. The loss of cohesin-binding sites in HDAC8 RNAi- 
depleted HeLa cells and in NIPBL-mutant LCLs”° was also noted in 
the p.G320R HDAC8-mutant LCLs and fibroblasts derived from the 
girl with the p.H180R mutation, in which small but consistent changes 
in transcriptional dysregulation were also noted (Supplementary Figs 
6a, 7 and 8). Together, these data suggest that loss of HDAC8 activity 
leads to a common pathogenic mechanism for classical CdLS that 
converges on the reduction of bound cohesin complexes. 
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Figure 2 | Cohesin and SMC3-ac localization sites in control and HDAC8 
RNAi-treated HeLa cells. a, Example of ChIP-Seq data (Ensemble gene 
position 1.68-2.10 Mb of human chromosome 12). ChIP-Seq data are shown in 
reads per million. Regions in which signals were significantly enriched (see 
Methods) are in red. Binding profiles for RAD21 and SMC3-ac in control and 
HDAC8 RNAi-treated HeLa cells are shown. b, Venn diagrams showing 
numbers and overlap of localization sites. c, Example of region-specific 
localization showing binding sites for RAD21 and SMC3-ac in the MYC gene 
locus. Ensemble gene position 128.728-128.773 Mb of human chromosome 8 is 
shown. d, Classification of RAD21 and SMC3-ac localization sites as a log ratio 
to background reads. Significance was calculated using a two-sample test for 
equality of proportions. *P < 0.05; **P < 0.001; ***P < 0.0001; —P> 0.05. 
Upstream and downstream are defined as within 5 kb from gene boundaries, 
respectively. SMC3-ac preferentially localizes to the downstream end of genes. 


To understand the endogenous effect of HDAC8 loss on cohesin 
further, we compared each cohesin subunit in chromatin-bound and 
unbound fractions in the LCLs with a hemizygous p.G320R mutation 
versus control LCLs, and unexpectedly noted accumulation of RAD21 
fragments in HDAC8-mutant cells (Supplementary Fig. 9a, lanes 2, 4 
and 6). Using a RAD21 amino terminal-specific antibody, we verified 
accumulation of the RAD21 N-terminal separase cleavage fragment 
(RAD21-N) in mutant LCLs, HDAC8 RNAi-treated HeLa cells and 
HDAC8-mutated fibroblasts expressing catalytically inactive HDAC8 
compared with wild-type HDAC8 (Fig. 4 and Supplementary Fig. 9). 
The accumulation of this cleavage fragment in the chromatin- 
bound fraction of HDAC8-mutant cells suggests that it may remain 
bound to the cohesin complex, so we assessed whether RAD21-N 
co-immunoprecipitates with other cohesin subunits. In HDAC8- 
mutant cells, RAD21-N co-immunoprecipitates with SMC1A, SMC3, 
STAGI and STAG2 (Fig. 4b), strongly suggesting that without func- 
tional HDACS8, the cleaved N terminus of RAD21 remains attached to 
the cohesin complex, presumably via acetylated SMC3 (ref. 24). To 
confirm that clearance of cleaved RAD21 is dependent on HDAC8, we 
performed complementary analyses in synchronized HeLa cells with 
or without the HDACS8 inhibitor PCI (Fig. 4c, d). These data demon- 
strate the appearance of the N-terminal and carboxy-terminal RAD21 
fragments after entry into mitosis (marked by histone H3 Ser 10 
phosphorylation; H3S10P) and activation of separase (Fig. 4d, lanes 
9-12 and 21-24) as well as their persistence in both soluble and 
chromatin fractions (Fig. 4d, lanes 18-20) when most cells are in G1 
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Figure 3 | HDACS8 mutations in CdLS. a, Facial features of individuals with 
HDAC8 mutations labelled with corresponding mutation and sex. b, Western 
blotting of protein from LCLs and fibroblasts (Fib). Sex is indicated under controls 
(1, 2 and 3) or mutation designation. c, HDAC8 mutations disrupt deacetylase 
activity. Bar graphs demonstrate the effect of HDAC8 mutations on deacetylase 
specific activity (nmol substrate pmol enzyme ‘ min‘). All assays were 
performed in triplicate. Error bars indicate standard deviation. Enzymatic activity 
for all mutations is significantly less than wild type (P < 0.05). d, Localization of 
HDACS8 mutations on the crystal structure (PDB accession code 3F06; ref. 23). 
Mutated residues are in red. e, The expression profile of HDAC8-mutant LCL is 
consistent with that seen in cell lines with NIPBL mutations. The summed 
standard deviations of gene expression compared with NIPBL mutant from 
control LCLs for 32 genes was determined for each sample. Box graphs 
demonstrate twenty-fifth, median and seventy-fifth percentiles of each group. 
Whiskers indicate minimum and maximum values. Note that for the HDAC8 
male (p.G320R) the values are for biological replicate samples for a single cell line 
(denoted by asterisk). Unpaired two-tailed t-test significances are indicated. 


phase (data not shown). By contrast, in untreated cells, these cleavage 
products are predominantly detected in the soluble fraction (Fig. 4d, 
lanes 2-4), and largely disappear 3h after release. 

We also made several observations about HDAC8 activity in the 
prophase removal of cohesin*®’*. Loss of HDAC8 activity does not 
disrupt the prophase pathway of cohesin removal from chromatin, 
as demonstrated by a consistent reduction of chromatin-bound 
cohesin in the presence of metaphase inhibitors in either control or 
PClI-treated cells (Fig. 4d, lanes 13 and 17 versus 21-24). However, 
HDACS8 activity is necessary after prophase removal of cohesin, 
because in metaphase inhibitor-treated cells, soluble SMC3-ac is 
markedly increased in the presence of PCI (Fig. 4d, lanes 9-12). 
Finally, with WAPAL depletion, which diminishes prophase removal 
of cohesin'*””*, we noted that SMC3-ac is substantially increased 
(Supplementary Fig. 9e). 

To test whether other cohesin-promoting proteins might remain 
associated with SMC3-ac in the absence of HDAC8 activity, we analysed 
HeLa and patient-derived cells, which also demonstrated increased 
retention of sororin (also known as CDCA5)'*” and RAD21-N, 
but not WAPAL, on soluble SMC3-ac (Fig. 4e-g and Supplementary 
Fig. 9d). Taken together, these data strongly suggest that HDACS8 is 
necessary for the deacetylation of SMC3 after its removal from 
chromatin in both prophase and anaphase pathways to enable proper 
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Figure 4 | Retention of RAD21-N and sororin on cohesin in the absence of 
HDACS. a, RAD21 separase cleavage sites. Vertical lines indicate cleavage 
sites. Grey region indicates the RAD21-N antibody epitope. b, SMC3, STAG1 
and STAG2 were immunoprecipitated from soluble extracts from normal 
(WT) and HDAC8-mutant (H) cell lines. Full-length RAD21 (FL), the cleaved 
N-terminal RAD21 fragment, SMC3-ac and STAG1/2 were analysed by SDS- 
PAGE and immunoblotting. c, Experimental schema of d. HeLa cells were 
synchronized by double thymidine arrest (DTA) and released (R) in the 
presence or absence of PCI. Seven hours after release, cells were treated with 
RO-3306 (RO) to arrest at the G2/M boundary. After removing RO-3306, cells 
were cultured in the presence or absence of nocodazole (Noc) or MG-132 (MG) 
to arrest in metaphase. Cells were collected at indicated time points after release 
from RO-3306 arrest. d, Cell extracts were fractionated into soluble and 
chromatin-bound fractions. Full-length RAD21, cleaved C- and N-terminal 
RAD21 fragments, SMC3-ac, SMC3, H3S10P and o-tubulin were analysed by 
SDS-PAGE and immunoblotting. H3S10P is used as a metaphase marker. 

e, Experimental schema of f and g. HeLa cells were synchronized and released 
and treated with RO-3306 as in d. After removing RO-3306, cells were cultured 
in the presence or absence of PCI and collected at the indicated times. f, FACS 
analysis of HeLa cells in g. g, Cell extracts were fractionated into soluble and 
chromatin-bound fractions. Levels of SMC3-ac, RAD21-N, full-length RAD21, 
sororin, WAPAL, cyclin B (a G2 marker) and SMC3 were analysed by SDS- 
PAGE and immunoblotting. 


dissolution of pro-cohesive elements and allow for recycling of 
‘refreshed’ cohesin for the next cell cycle (Supplementary Fig. 10). 

In summary, this work demonstrates that HDACS8 functions as a 
vertebrate SMC3 deacetylase to facilitate the renewal of cohesin after 
its removal from chromatin in prophase or anaphase, and that loss of 
HDAC8 activity results in decreased cohesin at localized sites to cause 
both cellular and clinical features of CdLS. 
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METHODS SUMMARY 


Cell culture and synchronization, chromatin fractionation, purification of recom- 
binant HDACS8 in baculovirus and E. coli, SMC3-ac deacetylation assays, RNA 
interference, immunofluorescence microscopy, human subjects, mutation screen- 
ing, HDAC8 conservation analysis, crystal structure analysis, expression profiling, 
assessment of X-chromosome inactivation, chromatin immunoprecipitation, 
ChIP-Seq and RNA-Seq analyses are described in detail in the Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Antibodies and reagents. Primary antibodies used in this study are as previously 
described for RAD21 (hSCC1)'°, SMC3-ac'® and the RAD21 N terminus”. 
Antibodies used are as follows: SAl, SA2, SMC1A, HDAC6, HDAC1O0, SIRT4, 
SIRT5, SIRT7 were from Abcam; «-tubulin was from Sigma; histone H3(S10P), 
HDACI1, HDAC2, HDAC3, HDAC4, HDACS, SIRT 1, SIRT3, SIRT6, SIRT7 and 
MAD2L1 were from Cell Signaling Technology; HDAC7, HDAC11 and SIRT2 
were from Millipore; and SMC3 was from Bethyl Laboratories. HDAC8 antibodies 
were purchased from Santa Cruz Biotechnology or derived as mouse monoclonal 
HDACS8 antibody raised against peptide COAYLQHLQKVSQEGDDDHPDS. 
PCI-34051 was purchased from Sigma Chemical or synthesized by Synstar Japan. 
TSA, sodium butyrate and nicotinamide were from Wako Chemical. Suberoyl 
anilide bishydroxamide was from Toronto Research Chemicals. Nocodazole was 
from Calbiochem. 

Cell culture and synchronization. HeLa cells were cultured in DMEM 
(Invitrogen) supplemented with 0.2mM t-glutamine, 100U ml’ penicillin, 
100pgml~' streptomycin and 10% FCS. LCLs were cultured in RPMI-1640 
supplemented with 0.2mM t-glutamine, 100Uml™! penicillin, 100g ml~! 
streptomycin and 20% FCS. Fibroblasts were cultured in DMEM (Invitrogen) 
supplemented with 0.2mM 1-glutamine, 100U ml! penicillin, 100 1gml"’ 
streptomycin and 20% FCS. For cell cycle synchronization, HeLa cells were 
synchronized by double thymidine arrest (14-16h in the presence of 2mM 
thymidine, 8h release, 16h in the presence of 2mM thymidine), and collected 
6h and 13h after the second release for enrichment in the G2 and G1 phase, 
respectively. For enrichment in prometaphase of mitosis, nocodazole (330 nM) 
was added 8 h after release from the second thymidine block for 1 or 2h, and cells 
were collected by shake off. 

Protein extraction and immunoprecipitations. To obtain total cell extracts, cells 
were lysed with lysis buffer (20 mM Tris-HCl, pH 8.0, 100 mM NaCl, 10 mM KCl, 
5mM MgCl, 0.2% NP-40, 10% glycerol, cOmplete protease inhibitor cocktail 
(Roche), and phosSTOP phosphatase inhibitor (Roche)) and DNA in the 
chromatin fraction was digested by the treatment of benzonase (Novagen). To 
obtain soluble cell extracts, the cells were lysed with lysis buffer containing 
200 mM NaCl and centrifuged at 20,000g. Immunoprecipitations were performed 
using ProteinG magnetic beads conjugated with 5 1g antibodies and incubated 
with the cell extracts for 5h at 4°C. After washing with lysis buffer, the beads were 
suspended with SDS-PAGE sample buffer. 

RNA isolation and RT-PCR. Total RNA was isolated using Trizol (Invitrogen) 
and Nucleospin RNA II (Macherey-Nagel) following the manufacturer’s instruc- 
tions. Synthesis of complementary DNA was performed using SuperScript III 
first-strand synthesis system (Invitrogen). cDNA was amplified with KAPA 
SYBR fast qPCR kit (Kapa Biosystems) on an ABI 7500 real time PCR system 
(Applied Biosystems). Primer sequences were: HDAC9 forward, 5'-CTTTGTC 
AGGTTCCTGCTGTICTC-3’, reverse, 5’-TTCCTGTTTCCACAAGGCATTTC-3’; 
ACTB forward, 5’-TGGCACCCAGCACAATGAA-3’, reverse, 5’-CTAAGTCA 
TAGTCCGCCTAGAAGCA-3’; and GAPDH forward, 5’-GCACCGTCAAGG 
CTGAGAAC-3’, reverse, 5'-TGGTGAAGACGCCAGTGGA-3’. 
Immunofluorescence microscopy. HeLa cells grown on 18-mm coverslips were 
fixed with 4% paraformaldehyde and permeabilized with PBS containing 0.5% 
Triton X-100 for 10min. After blocking with goat serum in PBS, cells were 
incubated with primary antibodies for SMC3 or SMC3-ac. DNA was stained with 
4',6-diamidino-2-phenylindole (DAPI). 

Chromatin fractionation. Performed as previously described”’. 

Chromosome spreads. Cells treated with nocodazole (100ng ml!) were 
collected by mitotic shake off and hypotonically swollen in 40% PBS, 60% tap 
water for 5min at room temperature. Cells were fixed with Carnoy’s solution 
(methanol:acetic acid = 3:1), dropped on glass slides and dried. Slides were stained 
with 5% Giemsa (Merck), and washed with water, air-dried and mounted with 
Entellan (Merck). 

Purification of recombinant HDAC8 and SMC3-ac deacetylation assay. Full- 
length cDNA of HDAC8 and the enzymatically dead HDAC8 mutant (D101A/ 
Y306F)*? was subcloned into pFASTBacHT (Invitrogen). The Bac-to-Bac baculovirus 
expression system (Invitrogen) was used to produce recombinant virus according 
to the manufacturer’s protocol. To express recombinant protein, Sf9 cells were 
infected with each baculovirus and incubated for 60h at 27 °C. Total cell lysates 
were prepared using Laemmli SDS buffer and boiling for 5 min. For purification of 
HDACS protein, infected cells were lysed with buffer A (20 mM Tris-HCl, pH 7.5, 
300mM NaCl, 0.1% NP-40 and 10mM 2-mercaptoethanol (2-ME)). After 
centrifugation at 3,000g the supernatants were incubated with Ni-NTA beads 
for 2h at 4°C. The beads were washed with buffer A, washed with buffer B 
(20 mM Tris-HCl, pH7.5, 100 mM NaCl, 10% glycerol 0.05% NP-40 and 1mM 
dithiothreitol) and eluted with buffer B containing various concentrate of 
imidazole (50, 100, 200 or 350mM). The elution fractions including HDAC8 


protein were collected and concentrated using a Centricon concentrator. For 
detection of SMC3 deacetylation by HDAC8, solubilized total lysates treated with 
benzonase or SMC1A immunoprecipitants were used as substrates. Substrates 
were incubated with recombinant HDAC8 or HDAC8 mutant in reaction buffer 
(50 mM HEPES-potassium hydroxide, pH 7.4, 10% glycerol, 10 mM KCl, 2.5 mM 
MgCl, 100 mM NaCl, 0.1 mM phenylmethylsulphonyl fluoride, 50 ngml~! BSA 
and 0.02% NP-40) for 1h at 30 °C and then boiled with SDS sample buffer. 
RNA interference. The siRNA oligonucleotides were annealed according to 
manufacturer’s instruction and used at a final concentration of 100-200 nM. 
The siRNA transfections were performed using oligofectamine (Invitrogen). 
Oligonucleotides targeting the firefly luciferase GL2 were used as controls. All 
siRNA sequences are listed in Supplementary Table 2. 

Human subjects. All individuals enrolled in the study were diagnosed by clinical 
geneticists experienced in the diagnosis of CdLS to have clinical features consistent 
with a diagnosis of CdLS. All patients and family members were enrolled in the 
study under an Institutional Review Board-approved protocol of informed consent 
at The Children’s Hospital of Philadelphia or the Institut fiir Humangenetik 
Litbeck. 

Mutation screening. All genes discussed were screened for mutations in the 
coding exons and intron-exon boundaries using PCR of genomic DNA followed 
by sequencing. Primers were designed using ExonPrimer (http://ihg.gsf.de/ihg/ 
ExonPrimer.html). Primer sequences and PCR conditions are available on request. 
Sequencing was performed using BigDye Terminator v3.1 cycle sequencing and 
analysed on an ABI 3730 (Applied Biosystems). All probands were pre-screened 
and were negative for mutations in NIPBL, SMC1A and SMC3. 

Reference sequences and HDACS8 conservation analysis. National Center for 
Biotechnology Information (NCBI) RefSeq accessions for messenger RNAs and 
proteins, respectively, referenced in this work include: HDAC8 (NM_018486 and 
NP_060956), NIPBL (NM_133433 and NP_597677) and SMC1A (NM_006306 
and NP_006297). HDACS protein sequences for Homo sapiens (AAF73428), Bos 
taurus (DAA12953), Rattus norvegicus (AA162023), Mus musculus (CAM17598), 
Danio rerio (NP_998596), Xenopus laevis (NP_001085711), Xenopus tropicalis 
deduced from mRNA BC161282, Drosophila melanogaster (AAC61494) and for 
the Saccharomyces cerevisiae HDAC8-like proteins Hos2 (NP_011321) and Rpd3 
(AAT92832) were aligned by the ClustalW method” using MacVector software 
(Accelrys Corp). 

Mapping mutations to the HDACS crystal structures. The identified HDAC8 
mutations were mapped onto the crystal structure data of human HDAC8- 
substrate complex (PDB accession 3F06 (ref. 23)) to visualize location with respect 
to active domains of the protein using Cn3D*, and PyMol”* software. 
Expression analysis of CdLS 32-gene classifier. Lymphocyte culture and RNA 
extraction were performed as previously described**. Genes and Nanostring”® 
probeset sequences are listed in Supplementary Table 3. For each sample, 
100 ng of RNA was analysed. Hybridizations were carried out by mixing 5 pl of 
each RNA sample (normalized to 20 ng pl” *) with 20 pl of NanoString nCounter 
Reporter probes and 5 jl of nCounter Capture probes (30 il total reaction volume) 
and incubating the hybridizations at 65°C for 18h. After hybridization purifica- 
tion, NanoString reporter capture, stretching and imaging was performed as 
described**. To account for differences in assay efficiency the data was normalized 
to the sum of six positive internal control RNA spikes that ranged from 0.125 
to128 fM. Internally normalized data were then subsequently normalized to the 
geometric mean of the ACTB and RPL19 spike-normalized counts. To assess 
reproducibility, Pearson’s r correlation coefficient was calculated for three technical 
replicates (average r = 0.99986) and 11 biological replicates (average r = 0.99675). 
To define a ‘CdLS’ profile, average expression and standard deviations were 
calculated for 32 genes using RNA from LCLs from 10 normal controls and 12 
NIPBL truncating or nonsense mutations. For each patient gene expression assay, 
the number of deviations in expression from normal was determined and divided 
by the average NIPBL expression standard deviation. These products were then 
summed for each gene to give a score representing a sum of deviation from normal 
in a CdLS manner for all 32 genes tested for each patient. The data and calculation 
results for each step are in Supplementary Data File 1. 

Recombinant HDACS8 protein expression and purification from E. coli. 
Identified human HDAC8 mutations were introduced into a previously described 
HDAC8-6His-pET20b construct” using a QuickChange site-directed mutagenesis 
kit (Agilent Genomics). Oligonucleotide sequences are available on request. 
HDACS was recombinantly expressed in BL21(DE3) E. coli cells and purified over 
Talon resin (Clontech) according to published procedures”, with minor modifica- 
tions. In brief, 50 ml cultures (Luria-Bertani media supplemented with ampicillin 
(50 pg ml~')) were grown overnight and used to inoculate 1-1 flasks (minimal 
media supplemented with ampicillin at 50 jig ml‘). Cells were grown at 37°C 
until Agogo nm Of ~0.5, at which point the cells were induced by the addition of 
isopropyl B-p-thiogalactopyranoside (0.4mM final concentration) and ZnCl, 
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(100 uM final concentration), and grown overnight at 18°C. The cells were 
pelleted by centrifugation, resuspended in 25 ml of lysis buffer (50mM Tris, 
pH 8.0, 500 mM KCl, 5% glycerol, 3 mM 2-MEand 115 «tM phenylmethanesulphonyl 
fluoride), and lysed by sonication on ice. Cell debris was pelleted by centrifugation 
and the cell-free extract was purified by affinity chromatography (Talon resin, 
Clontech Labs) using a step gradient (buffer A: 50mM Tris, pH 8.0, 500mM 
KCl, 5% glycerol and 3mM 2-ME; buffer B: 50 mM Tris, pH 8.0, 500mM KCl, 
5% glycerol, 3mM 2-ME and 250 mM imidazole). The final yield was approxi- 
mately 8.6 mg] ' culture for wild-type HDAC8, 6.7 mg] ' for HDAC8(G320R), 
6.1 mg]? for HDAC8(H334R), 5.8 mg]~ ' for HDAC8(H180R) and 3.9 mg]! for 
HDAC8(T311M); protein concentrations were determined using the colorimetric 
Bradford assay’. The final purity of each protein sample was better than 90% as 
determined by SDS-PAGE analysis. 

HDAC8 activity was measured using the commercially available Fluor-de-Lys 
HDACS deacetylase substrate and Developer II (Enzo Life Sciences). All assays 
were run at 25°C and contained 0.5 1M enzyme and 150M substrate in assay 
buffer (25 mM Tris, pH 8.2, 137 mM NaCl, 2.7 mM KCl and 1 mM MgCl). After 
30 min, the reactions were quenched by addition of the HDACS8 inhibitor M344 
(Sigma Aldrich, 100 1M final concentration) and Developer II. Fluorescence was 
measured using a Fluoroskan II plate reader (excitation = 355 nm, emission = 460 
nm). Product concentration was calculated from raw data using standard curves 
for substrate and product (aminomethylcoumarin, Enzo Life Sciences), with the 
total concentration of substrate and product kept at 150 1M (before dilution with 
the developer solution). All activity assays were performed in triplicate. Enzyme 
activities are reported as nmol product mol enzyme | min™’. 

Assessment of X-chromosome inactivation. X-chromosome inactivation (XCI) 
was determined by evaluating the methylation status of the CAG microsatellite 
locus at the 5’ end of the androgen receptor gene as previously described with 
modifications**”’. In brief, genomic DNA was isolated from peripheral blood and 
two reaction digests were performed for each patient. In one reaction, 1 1g of DNA 
was digested in 25 pl with the methylation-sensitive restriction enzyme Hpall 
(New England Biolabs), which cuts the active unmethylated allele. In the other 
reaction, DNA was incubated in enzyme digest buffer without enzyme. After a 16h 
incubation at 37 °C, digestion was terminated by incubation at 65°C for 20 min. 
From each reaction, 2 1 was then amplified by PCR with primers flanking 
the polymorphic androgen receptor CAG repeat as described**; forward, 
5'-CTGTGAAGGTTGCTGTTCCTCAT-3’, reverse, 5’-FAM-TCCAGAATCTG 
TTCCAGAGCGTGC-3’. ABI3730 Genetic Analyzer and GeneMapper V4.0 
software (Applied Biosystems) were used for genotyping analysis. The percentage 
of X-chromosome inactivation was calculated by dividing the ratio of the allele 
peak volumes in the Hpall-treated sample by the ratio of the allele peak volumes in 
the untreated sample. We considered alleles separated by more than two CAG 
repeats as informative” and used a cutoff of >80:20% for skewed XCI and >95:5% 
for extremely skewed XCI. 

Fibroblast immortalization and lentiviral transduction. CDL016 fibroblasts 
were immortalized by exogeneous expression of hTERT. HDAC8 and the 
enzymatically dead HDAC8 mutant (D101A/Y306F)” were cloned into pENTR 
H1 Gateway vector and recombined with lentivirus vector, pCS-RfA-CG, using LR 
Clonase II (Invitrogen). Recombinant lentiviruses were produced from 293FT 
cells transfected with CS-CMV-HDAC8 or CS-CMV-HDAC8mut and packaging 
vectors. The fibroblasts were incubated with recombinant lentiviruses in the 
presence of 8 jig ml“! poly-brene and transduction efficiency was measured using 
green fluorescent protein (GFP) expression. 

ChIP and quantitative PCR. Cells were crosslinked with 1% formaldehyde for 
10 min, quenched with 125mM glycine, and prepared for ChIP as previously 
described’’. ChIP was performed as previously described using RAD21, SMC3-ac 
and control antibodies’. In brief, crosslinked cell lysates solubulized by sonication 
were incubated with protein A or protein G Dynabeads (Dynal), crosslinked with 
the antibodies for 14h at 4°C. After this, beads were washed several times and 
eluted with elution buffer (50 mM Tris, 10 mM EDTA and 1% SDS) for 20 min at 
65 °C. The eluates were incubated at 65 °C overnight to reverse crosslinks and then 
treated with RNaseA and then with proteinase K. The samples were further purified 
by phenol-chloroform extraction and an extra purification step using a PCR puri- 
fication kit (Qiagen). Quantitative PCR was performed using KAPA SYBR Fast 
qPCR kit (Kapa Biosystems) on an ABI 7500 PCR system (Applied Biosystems). 
The results were presented as a percentage of input chromatin that was precipitated. 
Primers used in this study were listed in Supplementary Table 4. 

ChIP-Seq analyses. DNA from whole cell extracts (WCE) and ChIP fractions 
was further sheared to an average size of approximately150 base pairs (bp) by 
ultrasonication (Covaris), end-repaired, ligated to sequencing adapters and 
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amplified according to manufacturer’s instructions (Applied Biosystems SOLiD 
Library Preparation Protocol). Gel-purified amplified DNA between 100 and 
150 bp was sequenced on the Applied Biosystems SOLiD platforms (SOLiD 3 
and 5500) to generate single-end 50-bp reads. Sequenced reads of both ChIP 
fractions and WCE were aligned to the human genome (UCSC hg19) using 
Bowtie"’, allowing three mismatches in the first 28 bases per read (-n3 option). 
All duplicate reads and those without unique alignment were removed from 
further analysis. Sequencing, read and mapping data are summarized in 
Supplementary Table 5. We further analysed only uniquely aligned reads. Each 
aligned read was extended to a predicted fragment length of 150 bp. Reads were 
summed in 10-bp bins along the chromosome for ChIP and WCE. To facilitate 
comparison of detected peaks between different ChIP experiments, we normalized 
the read number of each bin per million uniquely mapped reads for that 
chromosome for both ChIP and WCE samples (that is, read number for each 
bin X 1,000,000/total number of reads mapped onto reference genomes). This 
value was further smoothed with a 500 bp width (50 bins). Scanning the genome 
with a 300 bp (30 bin) sliding window, a one-sided Wilcoxon rank-sum test was 
performed to estimate the enrichment P value for each window. The fold enrich- 
ment (ChIP/WCE) for each window was also calculated. To call peaks, we used 
stringent criteria to identify windows that satisfied both fold enrichment >3.0 and 
P<1X 10 “criteria for candidate-binding sites”. To eliminate uncertain sites 
further, we removed the regions with low ChIP reads using the criteria: (1) the 
average number of ChIP reads in region/the average number of ChIP reads in the 
genome <3.0; and (2) the maximum read intensity in ChIP bins was less than 0.2 
per million reads mapped. Peak calling summary statistics are summarized in 
Supplementary Table 6. 

RNA-Seq analyses. We prepared two replicates for each sample for RNA- 
sequencing (Solexa HiSeq2000, Illumina). Sequenced RNA reads were aligned 
to RefSeq RNA database (NM accession numbers only) using Bowtie*! allowing 
three mismatches in the first 28 bases per read (-n3 option), and aligned transcript 
reads were merged for a single gene. To accommodate transcriptional variants, we 
used both uniquely and multiply aligned reads. Multiply aligned reads were 
divided equally among all locations (N-times matched reads were weighted as 
1/N reads). Reads that aligned to reference database more than 10 times were 
discarded. The expression level of each gene was calculated in reads per kilobase 
per million reads (RPKM)* and normalized by trimmed mean of M values 
(TMM)**. RPKM scores of the two replicates were averaged to calculate the 
enrichment ratio. Using a 2.5 RPKM threshold, we obtained a total of 9,763 
expressed genes (Supplementary Table 7). 
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Human ES-cell-derived cardiomyocytes electrically 
couple and suppress arrhythmias in injured hearts 


Yuji Shibal?*, Sarah Fernandes!*, Wei-Zhong Zhu, Dominic Filice’*, Veronica Muskheli', Jonathan Kim!, Nathan J. Palpant', 
Jay Gantz!?, Kara White Moyes', Hans Reinecke’, Benjamin Van Biber', Todd Dardas*, John L. Mignone’, Atsushi Izawa?, 
Ramy Hanna*, Mohan Viswanathan*, Joseph D. Gold®, Michael I. Kotlikoff°®, Narine Sarvazyan’, Matthew W. Kay”®, 


Charles E. Murry'*4 & Michael A. Laflamme! 


Transplantation studies in mice and rats have shown that human 
embryonic-stem-cell-derived cardiomyocytes (hESC-CMs) can 
improve the function of infarcted hearts'*, but two critical issues 
related to their electrophysiological behaviour in vivo remain 
unresolved. First, the risk of arrhythmias following hESC-CM 
transplantation in injured hearts has not been determined. 
Second, the electromechanical integration of hESC-CMs in injured 
hearts has not been demonstrated, so it is unclear whether these 
cells improve contractile function directly through addition of new 
force-generating units. Here we use a guinea-pig model to show 
that hESC-CM grafts in injured hearts protect against arrhythmias 
and can contract synchronously with host muscle. Injured hearts 
with hESC-CM grafts show improved mechanical function and a 
significantly reduced incidence of both spontaneous and induced 
ventricular tachycardia. To assess the activity of hESC-CM grafts 
in vivo, we transplanted hESC-CMs expressing the genetically 
encoded calcium sensor, GCaMP3 (refs 4, 5). By correlating 
the GCaMP3 fluorescent signal with the host ECG, we found that 
grafts in uninjured hearts have consistent 1:1 host-graft coupling. 
Grafts in injured hearts are more heterogeneous and typically 
include both coupled and uncoupled regions. Thus, human myo- 
cardial grafts meet physiological criteria for true heart regenera- 
tion, providing support for the continued development of 
hESC-based cardiac therapies for both mechanical and electrical 
repair. 

Although hESC-CMs form gap junctions and beat synchronously in 
vitro®’, there is only indirect evidence for their electromechanical 
integration after transplantation’ *. We do not know whether hESC- 
CM grafts contract synchronously at physiological human rates, integ- 
rate in injured hearts despite scar tissue, or affect electrical stability. 
Indeed, both pro-arrhythmic? and anti-arrhythmic’® effects have been 
reported for mouse cardiomyocyte grafts in injured mouse hearts. 
Human cardiac grafts could plausibly contribute to arrhythmogenesis 
through automaticity’ and triggered activity’’, and their irregular 
graft geometry could promote reentry'*"®. 

To address these uncertainties, we employed a new guinea-pig 
model of cardiac injury. Prior work with hESC-CMs in infarcted hearts 
was carried out in mice and rats’, but these species’ rapid heart rates 
(approximately 600 and 400 beats per minute (b.p.m.), respectively'”) 
may prevent host-graft coupling or arrhythmias that could occur in 
humans. In vitro hESC-CMs show a spontaneous rate of approxi- 
mately 50 to 150 b.p.m.’*”*"* and can be paced up to 240 b.p.m., 
suggesting that they can keep up with the guinea-pig heart (approxi- 
mately 200 to 250 b.p.m."). 


We first examined the structural, mechanical and electrocardio- 
graphic consequences of hESC-CM transplantation in injured hearts 
of immunosuppressed guinea-pigs (Supplementary Fig. la). hESC- 
CMs were derived from H7 hESCs, as previously described”. Adult 
guinea-pigs were subjected to cardiac cryoinjury and implanted with 
telemetric electrocardiographic (ECG) transmitters. Ten days later, 
they underwent a repeat thoracotomy and intra-cardiac injection of 
either 1 X 10° hESC-CMs ina pro-survival cocktail (PSC) of factors 
previously shown to enhance hESC-CM engraftment” (n = 15), 
1 X 10° non-cardiac hESC-derivatives in PSC (non-CMs; n = 13), 
or PSC vehicle alone (n = 14). hESC-CMs were 63% pure by anti-c- 
actinin flow cytometry, whereas non-CMs included no detectable 
cardiomyocytes (Supplementary Fig. 2). 

Twenty-eight days after transplantation, all animals showed trans- 
mural scar and thinning of the left ventricle. Scar area was not different 
among the groups (13.2 + 0.9% of the left ventricle in hESC-CM, 
14.8 + 1.4% in non-CM, and 15.3 + 1.9% in PSC-only recipients). 
However, hESC-CM recipients showed partial remuscularization with 
islands of human myocardium occupying 8.4 + 1.5% of the scar area 
(Fig. 1a). The human origin of these grafts was confirmed by in situ 
hybridization with a human pan-centromeric probe, and more than 
99% of the human cells immunostained positively with the cardiac 
marker B-myosin heavy chain (BMHC; also known as MYH7) 
(Fig. 1b and Supplementary Fig. 3a—f). No teratomas developed, and 
hESC-CM grafts were negative for multiple non-cardiac markers 
(Supplementary Fig. 3g—k). Most of the graft myocardium was located 
in the central scar, but there were occasional points of host-graft 
contact in the border zone with shared intercalated discs identified 
by anti-connexin 43 and cadherin immunostaining (Fig. 1c-h and 
Supplementary Fig. 3d, e). Minimal immune reaction was observed 
in sections stained with a guinea-pig-specific pan-leukocyte marker 
(Supplementary Fig. 31). Grafts were supplied by host-derived 
neovessels that contained erythrocytes, indicating perfusion by the 
host coronary circulation (Supplementary Fig. 3h, m-o). 

Surviving human cells were found in only 7 of 13 non-CM recipients 
at 28 days after transplantation, and these grafts were smaller than 
those in hESC-CM recipients (less than 1% of the scar area). No 
BMHC-positive graft was detected in non-CM recipients; instead, 
the surviving human cells consisted of small epithelial nests and 
scattered fibroblastic cells (Supplementary Fig. 4). 

Hearts receiving hESC-CMs, non-CMs and PSC only, were assessed 
by echocardiography on days —2 and +28 relative to cell transplanta- 
tion. All groups showed increased left-ventricle dimensions and 
reduced fractional shortening on day —2 relative to uninjured controls 
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of host-graft contact, dual-labelled for HumCent 
(white) and BMHC (red) (c; a higher magnification 
image of the area in the white box in c is shown in 
d), de-stained and then immunostained for BMHC 
(red) and either connexin 43 (Cx43, green; e and 

* g) or cadherins (green; f and h). Arrows indicate 


0.06 Cx43 and cadherin shared between graft and host 
myocytes. i, Left-ventricle fractional shortening (FS) 

O04 by echocardiography in uninjured animals and 

ae cryoinjured recipients of PSC vehicle only, non-CMs 


or hESC-CMs at 2 days prior to and 28 days after 
0 transplantation (Tx). j, Representative telemetric 


(n = 11), but there were no differences among the cryoinjured groups 
(Fig. li and Supplementary Fig. 5). The two control groups exhibited 
further left ventricle dilation and deterioration of fractional shortening 
between days —2 and +28 (P < 0.05 for all pairwise comparisons 
between time points). This deterioration was completely attenuated in 
cryoinjured hESC-CM recipients, which showed significantly greater 
fractional shortening on day +28 than cryoinjured PSC-only or non- 
CM recipients (P < 0.05 and P < 0.01, respectively) (Fig. 1i). 

Ambulatory telemetric ECG recordings were obtained regularly 
from 1 day after injury to 28 days after transplantation. Pilot studies 
revealed only rare premature ventricular contractions (PVCs) in 
uninjured animals, whereas cryoinjured but not transplanted animals 
showed occasional PVCs and runs of ventricular tachycardia 
(Supplementary Table 1). Interestingly, the hESC-CM group had the 
lowest fraction of animals with PVCs after transplantation and the 
lowest rate of couplet PVCs per hour (Supplementary Fig. 6). 
Moreover, fewer hESC-CM recipients showed spontaneous ventricular 
tachycardia, and hESC-CM recipients were the only group with no 
sustained ventricular tachycardia (Fig. 1j, k). Next, we counted all 
episodes of ventricular tachycardia from day 3 to 28 and found that 
non-CM recipients showed 785% more ventricular tachycardia 
episodes than hESC-CM recipients (95% confidence interval: 74% to 
+4,370%, P < 0.01; Fig. 11). 

Asa final test of electrical stability, we induced arrhythmias through 
programmed electrical stimulation (PES). 'To detect either a positive or 
negative effect of cell transplantation, we developed a PES protocol that 
induced ventricular tachycardia in 40% of non-transplanted animals at 
38 days after cryoinjury (n = 5), but not in uninjured animals (n = 8). 
When cryoinjured hESC-CM recipients were challenged with PES at 
28 days after transplantation, ventricular tachycardia was induced in 
only 6.7% of these animals versus 61.5% and 50% of cryoinjured non- 
CM and PSC-only recipients, respectively (P < 0.01 by Fisher’s exact 
test, Fig. 1m, n). 

To explore the mechanism by which hESC-CMs improved mech- 
anical and electrical function, we investigated whether the grafts were 
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from a cryoinjured non-CM recipient induced to 
sustained VT by PES. n, Percentage of animals in 
each group induced to VT. All data are presented as 
mean + s.e.m.; 1 = 13 per group. NS, not significant. 
*P < 0.05; **P < 0.01. fP < 0.05 versus day —2. 


coupled and beat synchronously with host myocardium. We used 
zinc-finger nuclease (ZFN)-mediated transgenesis”’ to create hESC- 
CMs that stably expressed the genetically encoded fluorescent calcium 
sensor, GCaMP3 (refs 4, 5), and these myocytes exhibited robust 
fluorescence transients with each contractile cycle in vitro (Sup- 
plementary Fig. 7 and Supplementary Movie 1). We transplanted 
1 X 10° GCaMP3-positive hESC-CMs (cardiac purity of 74 + 10%) 
into intact and cryoinjured guinea-pig hearts and carried out intravital 
imaging at 2 or 4 weeks after transplantation (Supplementary Fig. 1b). 
Epicardial fluorescent transients indicated activation of the graft in vivo, 
and these were correlated with the ECG to determine their synchrony 
with host myocardium”. 

The GCaMP3-positive hESC-CM grafts in uninjured hearts had 
extensive host-graft contact with little intervening fibrosis (Fig. 2a 
and Supplementary Fig. 8a). Correspondingly, these hearts showed 
an impressive degree of host-graft electromechanical integration. 
We found large GCaMP3-positive hESC-CMs grafts in all uninjured 
recipients at 14 days after transplantation (n = 9), and 100% of the 
visible graft in each animal showed calcium fluorescence transients 
that synchronized 1:1 with systole in the host ECG (Fig. 2b). These 
hearts were also imaged ex vivo during mechanical arrest with either 
2,3-butanedione monoxime (BDM) or blebbistatin (n = 11). 1:1 host- 
graft coupling continued under these conditions, ruling out confound- 
ing motion artefacts and the possibility of indirect graft activation 
by passive stretch (Supplementary Fig. 8 b, c and Supplementary 
Movie 2). hESC-CM grafts in uninjured hearts remained fully coupled 
during pacing at rates =5 Hz (Fig. 2c). 

Next, we transplanted GCaMP3-positive hESC-CMs at 10 days after 
cryoinjury and carried out intravital imaging at 14 and 28 days after 
transplantation (n = 7 and n = 15 animals, respectively). By histology, 
surviving grafts at both time points were mostly located within the 
central scar, although occasional points of host-graft contact in the 
border zone were identified (Fig. 3a and Supplementary Fig. 9a). 
Dual labelling with a guinea-pig-specific in situ probe and anti-green 
fluorescent protein (GFP) immunostaining ruled out a meaningful 
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a b Uninjured heart, in vivo 
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Figure 2 | hESC-CM grafts in uninjured hearts show 1:1 coupling with host 
myocardium. a, GCaMP3-positive hESC-CM graft in an uninjured heart 
immunostained for GFP (green) and BMHC (red), showing extensive host- 
graft contact with minimal intervening scar. b, Left, GCaMP3-positive hESC- 
CM graft in an uninjured heart during systole and diastole, acquired using an 
open-chest preparation at 14 days after transplantation. Right, GCaMP3 
fluorescence intensity (FI) versus time for the red, green and blue regions of 
interest indicated in the left panel, as well as the host ECG (black). GCaMP3 
fluorescent transients in all three regions occurred with 1:1 coupling with QRS 
complexes of the host ECG. a.u. arbitrary units. c, GCaMP3 fluorescent signal 
(red) and ECG (black) from a representative hESC-CM graft in an uninjured 
heart, imaged ex vivo under mechanical arrest. 1:1 host-graft coupling occurred 
during spontaneous beating and pacing at rates =5 Hz. Arrow, onset of pacing. 


contribution of cell fusion to the observed GCaMP3 signals 
(Supplementary Fig. 9b and Supplementary Movie 3). 

Cryoinjured hearts with GCaMP3-positive hESC-CM_ grafts 
showed robust epicardial fluorescence transients, but their behaviour 
was more heterogeneous than that of grafts in uninjured animals, with 
greater variation in size, distribution and temporal relationship to the 
host ECG. Given this complicated behaviour, we focused on imaging 
mechanically arrested hearts ex vivo, so that the grafts could be more 
completely visualized and the proportion of coupled and uncoupled 
graft could be quantified better (Supplementary Movies 4 and 5). We 
found regions of GCaMP3-positive graft with consistent 1:1 host-graft 
coupling in approximately 60% of hearts at both 14 and 28 days after 
transplantation (4 out of 7 animals, and 9 out of 15 animals, respectively; 
Fig. 3b and Supplementary Tables 2 and 3). Although there was no 
significant difference in the fraction of animals with coupled grafts 
between the two time points, in the subset of animals with at least some 
coupling, there was a trend towards greater host-graft coupling over 
time (66 + 10% versus 87 + 6% of visible graft at 14 and 28 days, 
respectively). No grafts with intermittent or partial coupling (for 
example, 1:2 or 1:3 host-graft coupling) were observed. Uncoupled graft 
regions showed regular GCaMP3 fluorescent transients, but these 
occurred with a periodicity unrelated to the ECG or neighbouring graft 
domains, usually at a rate slower than the host (at 28 days, 96 + 11 b.p.m. 
versus a host rate of 176 + 9 b.p.m.). By marking the coupled and 
non-coupled graft regions with indelible inks that survive histological 
processing, we found many examples of reliably coupled graft located 
within the central scar region, proving that coupling is not limited to 
grafts at the border zone (data not shown). When injured hearts with 
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Figure 3 | hESC-CM grafts show 1:1 host-graft coupling in the majority of 
injured recipient hearts, but the extent of coupling and pattern of activation 
is variable. a, GCaMP3-positive hESC-CM graft in a cryoinjured heart 
immunostained for GFP (green) and BMHC (red). Small graft nests were 
located in host muscle within the border zone, but most of the graft was in scar. 
b, Percentage of visible GCaMP3-positive hESC-CM graft in each cryoinjured 
heart that showed 1:1 host-graft coupling at 14 and 28 days after 
transplantation (n = 7 and n = 15 animals, respectively). c, Representative 28- 
day-old GCaMP3-positive hESC-CM graft in a cryoinjured heart imaged ex 
vivo during mechanical arrest with blebbistatin and pacing at 3 Hz. Top, traces 
of mean fluorescent intensity versus time for graft regions located within host 
muscle (1, blue font) and the cryoinjury zone (2, red font). Both were activated 
in a 1:1 correlation with the host ECG (black). Bottom, corresponding 
activation map showing the interval (in ms) between the stimulus pulse and the 
local rise in GCaMP3 fluorescence. Graft in host muscle (1) showed uniformly 
rapid activation, whereas the graft in the scar (2) activated first in centre of the 
scar and then gradually progressed towards the border zone. On other 
occasions, graft activation started at the border zone and radiated into the scar. 
d, Top, traces of GCaMP3 fluorescence and EGG. Bottom, the activation map 
for a representative GCaMP3-positive hESC-CM graft in a blebbistatin- 
arrested uninjured heart. This graft showed 1:1 host-graft coupling and a brief 
interval between stimulus and GCaMP3 transient upstroke. 


GCaMP3-positive hESC-CM grafts were paced, most of the initially 
coupled graft regions remained coupled up to 4 Hz, after which some 
loss of coupling was observed (Supplementary Fig. 10). 

We next attempted to perform voltage mapping of graft and host 
myocardium in cryoinjured hearts but, unexpectedly, hESC-CM grafts 
failed to label with the potentiometric dyes RH237 and di-4-ANEPPS 
(Supplementary Fig. 11). hESC-CMs were readily labelled in vitro 
(Supplementary Fig. 12), so their poor staining in vivo probably reflects 
sluggish perfusion of the graft tissue and perhaps reduced membrane 
staining because of undeveloped T-tubules”. However, we did obtain 
good labelling of host myocardium with RH237 and we therefore com- 
pared voltage maps from cryoinjured hearts with and without grafts 
(n = 6 and n = 4 animals, respectively). As expected, both groups 
showed robust propagation in uninjured myocardium, whereas optical 
action potential amplitudes and epicardial conduction velocities were 
reduced in the cryoinjury zone. The two groups showed no significant 
differences in the amplitudes of optical action potentials within the 
injured zone or the kinetics of activation and repolarization 
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(Supplementary Figs 13 and 14), suggesting that hESC-CM trans- 
plantation does not exert major paracrine effects on host electrical 
behaviour. 

Because the electrical activity of hESC-CM grafts could not be 
evaluated with potentiometric dyes, we used instead the GCaMP3 
signal to assess graft activation and propagation. GCaMP3 transients 
lag depolarization owing to slow binding of calcium, but this delay 
should be the same throughout the graft’®”*, so graft activation maps 
can be generated using the time interval between electrical stimulation 
(or host QRS complex of the ECG) and the GCaMP3 transient. This 
revealed a marked difference in activation between grafts located in 
viable host muscle versus those in scar tissue (Fig. 3c, d). Grafts in 
intact myocardium were activated uniformly and within approxi- 
mately 10 ms, probably reflecting synchronous activation through 
multiple points of contact with the host. In contrast, graft activation 
within scar tissue was spatially and temporally heterogeneous, even in 
regions with 1:1 coupling. These coupled grafts showed initial activa- 
tion within approximately 10 to 20 ms, but the spread of activation was 
slower, such that some areas required >50 ms for activation. This 
probably reflects the more limited contacts between graft and host, 
as well as slower propagation within the immature graft myocardium. 

These intravital imaging studies are, to our knowledge, the first 
direct demonstration that human cardiomyocytes can integrate and 
contract synchronously with host myocardium. Although additional 
paracrine mechanisms cannot be excluded, the demonstration of elec- 
tromechanically coupled grafts in injured hearts supports the idea that 
hESC-CMs can improve mechanical function by creating new force- 
generating units, a sine qua non for heart regeneration. However, 
hESC-CM grafts in injured hearts were not always fully integrated, 
implying that enhancing their integration may further benefit con- 
tractile function. The model developed here represents a reasonably 
high-throughput platform to test strategies to improve host-graft 
integration (for example, tissue engineering, attenuating graft cell 
death and reducing fibrosis). 

Our study also provides reassurance about the arrhythmogenic risk 
of cardiac repair with immature stem-cell-derived cardiomyocytes. 
Consistent with prior transplantation studies with primary fetal mouse 
cardiomyocytes’’, we observed an arrhythmia-suppressive effect that 
was unique to cardiomyocyte grafts and occurred despite incomplete 
host-graft coupling. Although this is encouraging, before clinical 
translation of hESC-based therapies, it will be necessary to carry out 
safety and efficacy studies with longer follow-up durations and in a 
large animal model. 


METHODS SUMMARY 


GCaMP3-positive hESCs were generated by ZFN-mediated targeting, following 
a method described previously” (Supplementary Fig. 7), and these transgenic and 
wild-type hESCs were differentiated into cardiomyocytes, as previously reported””°. 
Guinea-pigs underwent cryoinjury, immunosuppression with cyclosporine plus 
methylprednisolone, cell transplantation and physiological monitoring as detailed 
in the Supplementary Methods and Supplementary Fig. 1. Intravital imaging of 
GCaMP3-positive hESC-CM grafts was carried out on day 14 or 28 after transplanta- 
tion, using open chest and ex vivo Langendorff preparations. GCaMP3 and RH237 
fluorescent signals were acquired with a high-speed camera (Andor iXon 860 EM- 
CCD) on an epifluorescence stereomicroscope. All procedures complied with regula- 
tions of the University of Washington Institutional Animal Care and Use Committee. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cell preparation. Undifferentiated H7 hESCs* were expanded using either mouse 
embryonic fibroblast-conditioned medium (MEF-CM)” or a defined medium 
supplemented with basic fibroblast growth factor and transforming growth factor 
B1 (R&D Systems)”*. hESCs were then differentiated into cardiomyocytes using 
our previously reported directed differentiation protocol that involves the serial 
application of activin A and bone morphogenetic protein 4 (BMP4, R&D) under 
defined, serum-free, monolayer culture conditions*’*”°. Non-cardiac control 
cells were generated by subjecting hESCs to the same protocol but without 
the addition of activin A and BMP4. See Supplementary Fig. 2 for additional 
immunophenotypic information regarding the two cell preparations employed. 

Human ESC-CMs and non-cardiac hESC derivatives were collected and 
cryopreserved after 16 to 18 days under differentiating conditions. One day before 
collection, cells were subjected to a pro-survival protocol, previously shown to 
enhance engraftment after transplantation’. In brief, cultures were heat-shocked 
with a 30-min exposure to 43 °C medium, followed by RPMI-B27 medium 
supplemented with IGF1 (100 ng ml’, Peprotech) and cyclosporine A (0.2 1M, 
Sandimmune, Novartis). One day later, cultures were collected with 0.25% trypsin 
per 0.5 mM EDTA (Invitrogen) and cryopreserved as described previously”. 

Immediately before transplantation, cells were thawed at 37 °C, washed with 

RPMI, and suspended in a 150-11 volume (per animal) of pro-survival cocktail 
(PSC)’, which consisted of 50% (v/v) growth factor-reduced Matrigel, supplemented 
with ZVAD (100 mM, benzyloxycarbonyl-Val-Ala-Asp(O-methyl)-fluoromethyl 
ketone, Calbiochem), Bcl-XL BH4 (cell-permeant TAT peptide, 50 nM, 
Calbiochem), cyclosporine A (200 nM, Wako), IGF1 (100 ng ml‘, Peprotech) 
and pinacidil (50 mM, Sigma). 
Surgical procedures. For cardiac cryoinjury and ECG transmitter placement, 
male guinea-pigs (650-750 g, Charles River) were anaesthetized with an intraper- 
itoneal injection of 50 mg kg ' ketamine and 2 mg kg”! xylazine. The skin was 
shaved and sterilized, a small incision was made in the right flank and the trans- 
mitter (PhysioTel model CA-F40, DSI) was inserted in a subcutaneous pocket. The 
positive and negative leads were then tunnelled subcutaneously to the left fifth 
intercostal space and the upper sternal midline, respectively. The animal was then 
intubated, mechanically ventilated and anaesthetized with 1.5% isoflurane. The 
heart was exposed by a left thoracotomy, the pericardium was opened and a 
10-mm-diameter aluminium cryoprobe, pre-cooled with liquid nitrogen, was 
applied to the left ventricular free wall 4 times for 30 s each. At the end of the 
surgery, all animals received 0.025% (v/v) topical bupivicaine at the wound site, as 
well as 0.05 mg kg ' intraperitoneal buprenorphine for analgesia. 

From day —2 to day +28 relative to cell transplantation, animals were treated 
with an immunosuppressive regimen of methylprednisolone (2 mg kg”! per day, 
intraperitoneal) and cyclosporine A (15 mgkg’ ' per day, subcutaneous, for 7 days; 
thereafter reduced to 7.5 mg kg ' per day). The trough level of blood cyclosporine 
was measured on day +28 at 838 + 641g] '. Cell transplantation was carried out 
using a repeat thoracotomy and injecting the total bolus of cells into three separate 
injection sites, that is, the central cryolesion and the flanking lateral border zones. 
Echocardiography. On days —2 and +28 relative to cell transplantation, animals 
were lightly anaesthetized with inhaled 1.5% isoflurane, and then their left- 
ventricular end-diastolic dimension (LVEDD), left-ventricular end-systolic 
dimension (LVESD) and heart rate were measured by transthoracic echocardio- 
graphy (GE Vivid 7) with a 10-MHz paediatric transducer. Fractional shortening 
(FS) was calculated using this equation: FS = 100 x ((LVEDD - LVESD) / 
LVEDD). All echocardiographic scans and analyses were performed by an 
operator who was blind to the experimental conditions. 

Telemetric electrocardiography. ECG recordings were acquired from conscious, 
freely mobile animals using a Dataquest ART telemetry system (DSI)'***. All 
recordings were obtained at the same time of day (evening) for 6 h (3 h in the 
light, 3 h in the dark) during a period in which the animals’ living quarters were 
undisturbed. Recordings were obtained from cryoinjured recipients of hESC-CMs, 
non-CMs or PSC only, on days 1, 6 and 9 after injury and days 1, 3, 7, 10, 14, 17,21, 
24 and 27 after transplantation. All ECG traces were evaluated manually by a 
cardiologist who was blind to the experimental conditions. The cardiologist deter- 
mined the total number and frequency of events including single and multiform 
PVCs, as well as non-sustained and sustained ventricular tachycardia . In accord- 
ance with Lambeth convention guidelines”, ventricular tachycardia was defined as 
a run of 4 or more PVCs, and sustained ventricular tachycardia was defined as a 
fast ventricular rhythm of more than 15 beats. 

Programmed electrical stimulation. PES studies were performed on day +28 
after transplantation, using methods modified from a previous study”. In brief, 
each animal was mechanically ventilated, anaesthetized with 2% isoflurane, and 
outfitted for standard surface ECG recordings (ADInstruments). A midline 
incision was made in the epigastric region, and a custom-designed bipolar stimu- 
lating electrode (FHC) was inserted through the diaphragm into direct contact 


with the cardiac apex. PES studies were then carried out using a stimulator 
generator (STG-1000, MultiChannel Systems) with the pulse output set at twice 
the capture threshold and the pulse width at 1 ms. We used standard clinical PES 
protocols, including the application of single, double and triple extra stimuli after a 
train of 8 conditioning stimuli at a 150-ms cycle length. To determine the vent- 
ricular effective refractory period (VERP), a first extra stimulus (S2) was applied 
with the S1-S2 interval decreased in 5-ms increments from 150 ms until the 
occurrence of loss of capture. The heart was then challenged three times with 
the train of eight followed by the single extra stimulus (with the $1-S2 interval 
set at VERP + 10 ms). If no ventricular tachycardia was induced, this procedure 
was repeated to apply three challenges with double and, if necessary, triple extra 
stimuli. In each case, the coupling interval between the final two pulses was set at 
the VERP for the last extra stimulus + 10 ms. Ventricular tachycardia was only 
induced in cryoinjured animals with the application of a triple extra stimulus. 
Notably, ventricular-tachycardia induction by PES was strongly associated with a 
history of spontaneous ventricular tachycardia during the final day of telemetry 
monitoring (P < 0.001, McNemar’s test). 

Generation of the GCaMP reporter hESC line. A transgene encoding for the 
constitutive expression of GCaMP3 was inserted into the AAVS1 locus in H7 
hESCs, using methods adapted from a previous study~’ (see Supplementary Fig. 
7). In brief, the right and left arms of an AAVS1-specific ZFN were de novo 
synthesized (Genscript) and cloned into a single polycistronic plasmid in which 
the expression of each was driven by an independent human PGK promoter. A 
second polycistronic vector was generated in which (approximately 800-bp) 
homology arms flanking the AAVS1 ZFN cut site (pZDonor, Sigma Aldrich) 
surrounded a 5.1-kb insert with two elements: a cassette in which the CAG pro- 
moter drives expression of GCaMP3 (ref. 5) (Addgene, plasmid #22692) and a 
second cassette encoding for PGK-driven expression of neomycin resistance. 

AAVS1 ZEN (5-ug) and AAVS1 CAG GCaMP3 targeting vector (40-1) 

plasmids were co-electroporated (Lonza, Nucleofection system) into H7 hESCs 
cultured in MEF-CM supplemented with 10 uM Y-27632. Green fluorescent 
colonies were isolated and expanded and selected with 40 to 100 yg ml ' G418 
(Invitrogen) for 5 to 10 days. GCaMP3 H7 hESCs showed a normal karyotype. 
Southern blot analysis. Wild-type and transgenic GCaMP3-positive hESC geno- 
mic DNA were digested with the restriction enzyme Sphl, run on 1% polyacryla- 
mide gel and transferred to a membrane (BioRad Zeta Probe). The membrane was 
washed in 2 SSC and dried at 80 °C ina hybridization oven for 2 h, followed by 1 
h of pre-hybridization in 50% formamide, 0.12 M NaH,PO,, 0.25 M NaCl, 7% 
SDS, and 1 mM EDTA at 43 °C. A hybridization probe was generated with the 
following primers: CCTGTTAGGCAGATTCCTTATC (sense), AGATGGTGGA 
CGAGGAAGGGG (antisense). The probe was labelled with 2p dCTP 
(Amersham Megaprime DNA labelling system) and hybridized overnight in 
hybridization buffer at 43 °C. After 24 h, the membrane was washed for 20 min 
with 2 SSC/0.1% SDS followed by 20 min in 0.1 SSC/0.1% SDS. The membrane 
was then exposed to autoradiographic film for 3 days. The wild-type band is 
expected at 6.5 kb, whereas the targeted locus shifts to 2.9 kb. 
Imaging of GCaMP3-expresssing grafts. Intravital imaging of hearts with 
GCaMP3-positive grafts was performed on days 14 and 28 after transplantation 
using either an open-chest or ex vivo preparation. For the open-chest preparation, 
guinea-pigs were anaesthetized, mechanically ventilated and outfitted for standard 
surface ECG recordings, as described above. The anterior epicardium was then 
exposed by a wide thoracotomy and visualized using an epifluorescence stereo- 
microscope (Nikon, SMZ 1000) equipped with an EXFO X-Cite illumination 
source. GCaMP3 was excited at 450 to 490 nm and bandpass filtered (500 to 
550 nm) before detection by an electron-multiplying, charge-coupled device 
camera (Andor iXon 860 EM-CCD) controlled by Andor Solis software. 
GCaMP3 image acquisition was typically at 80 to 140 frames per second (f-p.s.). 
Signals from the charge-coupled device (CCD) camera and the surface ECG were 
fed through a computer for digital storage and off-line analysis using Andor 
software and custom Matlab scripts (MathWorks). 

Some engrafted hearts were also imaged ex vivo after mechanical arrest to 
eliminate motion artefacts. For these experiments, the heart was collected, rapidly 
mounted on a gravity-fed Langendorff apparatus and then perfused with modified 
Tyrode solution at 37 °C. The epicardial GCaMP3 signal was then recorded 
before and after supplementation of the perfusate with an excitation-contraction 
uncoupler, either 2,3-butanedione monoxime (BDM, 20 mM)??? or blebbistatin 
(10 1M). The utility of blebbistatin was limited by its substantial blue-green 
fluorescence**, which interfered with the acquisition of the GCaMP3 signal, so 
BDM was used in experiments except where otherwise indicated (see 
Supplementary Table 3). All quantitative comparisons were carried out using 
hearts treated with the same uncoupler. The electrical activity of the isolated heart 
was continuously monitored by placing positive and negative ECG leads at the 
base of the right ventricle and left ventricular apex, respectively. Although most 
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imaging experiments were carried out under spontaneous conditions, a subset of 
hearts was also imaged during external pacing at rates from 3 to 6 Hz (ML866 
PowerLab 430 Data Acquisition System). Experiments involving pacing at rates of 
greater than 5 Hz were sometimes limited by the occurrence of irreversible 
tachyarrhythmias. 

Ina subset of hearts, GCaMP3 activation maps were generated by determining 
the interval between a pacing stimulus (or the host QRS complex) and the initial 
rise in GCaMP3 fluorescence. For this analysis, raw data from the green fluor- 
escence and ECG channels were read into a custom Matlab script for analysis. The 
GCaMP3 fluorescent signal was median filtered in time to reduce noise, smoothed 
with a spatial conical filter, and background subtracted. Regions of graft were user 
defined and then analysed to determine the relative GCaMP3 activation time on 
a per-pixel basis. Relative GCaMP3 activation times for each pixel were then 
averaged over several stimuli to generate GCaMP3 activation maps. 
Voltage-mapping studies. The preceding imaging system and experimental 
preparation was also used in optical mapping studies with the potentiometric 
dye RH237. RH237 was selected as our dye of choice, given our past experience 
with this indicator and its reasonable spectral separation from GCaMP3. When 
examined in the same heart, GCaMP3 (green-channel) and RH237 (red-channel) 
fluorescent signals were acquired consecutively by an exchange of filter cubes, not 
simultaneously. 

In brief, cryoinjured hearts with or without 28-day-old GCaMP3-positive 
hESC-CM grafts were mounted on a Langendorff apparatus and loaded with 
RH237 (40 4M) by a bolus injection (in 10 ml buffer) into the aorta. After an 
initial stabilization period, RH237-stained hearts were then imaged during both 
spontaneous and paced rhythms. RH237 was excited at 520 to 550 nm, and 
emitted light was long-pass filtered (670 nm) before detection at 500 f.p.s. using 
the high-speed EM-CCD system mentioned above. The resultant fluorescent 
signals were analysed as described previously’. In brief, background fluor- 
escence was subtracted and the signal at each pixel was smoothed using a median 
temporal filter and a spatial cone filter. The signal at each pixel was then scaled as a 
percentage of the maximum optical action potential amplitude for all the pixels. 
Although not simultaneously acquired, RH237 and GCaMP3 signals could be 
partially correlated in a subset of paced hearts by aligning repetitive signals relative 
to the stimulating electrode. 

Histology. Histological studies were carried out as detailed previously by our 
group~”**’. For immunohistochemistry, we used the primary antibodies detailed 
in Supplementary Table 4, then either fluorescent secondary antibodies (Alexa- 
conjugated, species-specific antibodies from Molecular Probes) or the avidin- 
biotin reaction followed by chromogenic detection (ABC kits from Vector 
Labs). The guinea-pig-specific in situ hybridization probe was generated using 
methods adapted from another study’. In brief, guinea-pig genomic DNA was 
amplified using PCR primers reported previously*® (sense: 5'-CTCCTGTCCT 
GCATCCACT-3’; antisense: 5’-GGATATGAGAGACAGTGGTG-3’). The 
resultant 345-bp band was excised and then amplified in a second round of 
PCR doped with digoxigenin-11-dUTP. Subsequent in situ hybridization with 
this digoxigenin-labelled guinea-pig-specific probe was carried out using methods 
detailed previously for the human-specific pan-centromeric probe’. The 
hybridized probe was detected using a peroxidase-conjugated anti-digoxigenin 
antibody (Roche), followed either by a chromogenic peroxidase substrate or fluor- 
escent tyramide signal amplification (Molecular Probes). 

Statistical analysis. All data were analysed in a blind manner, with the identifier 
code revealed only after the analysis was completed. All values were expressed as 
mean + standard error. Statistical analyses were performed using PASW Statistics 
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18, Graphpad Prism and SAS software, with the threshold for significance level set 
at P < 0.05. Echocardiographic outcomes were analysed using an analysis of 
variance (ANOVA), then by post-hoc comparisons between groups by Tukey 
HSD, and time-course changes were analysed using paired t-test analysis of means. 
For comparisons of the fraction of animals showing spontaneous or induced 
arrhythmias, we used a two-sided Fisher’s exact test. The incidence of spontaneous 
arrhythmias was analysed using a Poisson regression model to adjust for repeated 
measures and the effect of time. This approach was used to calculate the ratio of the 
observed frequencies of recurrence for the two groups and the 95% confidence 
interval. We used the McNemar’s test for correlated proportions to test for an 
association between induced ventricular tachycardia and spontaneous ventricular 
tachycardia either on the final day of monitoring or throughout days 3 to 28 after 
the transplantation monitoring period. To test for synchrony between GCaMP3 
fluorescence and ECG signals, intervals were compared using ANOVA and then a 
post-hoc Dunnett’s test. 
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The rapid disruption of tropical forests probably imperils global bio- 
diversity more than any other contemporary phenomenon’ *. With 
deforestation advancing quickly, protected areas are increasingly 
becoming final refuges for threatened species and natural ecosystem 
processes. However, many protected areas in the tropics are them- 
selves vulnerable to human encroachment and other environmental 
stresses*°. As pressures mount, it is vital to know whether existing 
reserves can sustain their biodiversity. A critical constraint in 
addressing this question has been that data describing a broad array 
of biodiversity groups have been unavailable for a sufficiently large 
and representative sample of reserves. Here we present a uniquely 
comprehensive data set on changes over the past 20 to 30 years in 31 
functional groups of species and 21 potential drivers of environ- 
mental change, for 60 protected areas stratified across the world’s 
major tropical regions. Our analysis reveals great variation in 
reserve ‘health’: about half of all reserves have been effective or 
performed passably, but the rest are experiencing an erosion of 
biodiversity that is often alarmingly widespread taxonomically 
and functionally. Habitat disruption, hunting and forest-product 
exploitation were the strongest predictors of declining reserve 
health. Crucially, environmental changes immediately outside 
reserves seemed nearly as important as those inside in determining 
their ecological fate, with changes inside reserves strongly mirroring 
those occurring around them. These findings suggest that tropical 
protected areas are often intimately linked ecologically to their 
surrounding habitats, and that a failure to stem broad-scale loss 
and degradation of such habitats could sharply increase the 
likelihood of serious biodiversity declines. 

Tropical forests are the biologically richest ecosystems on Earth’. 
Growing concerns about the impacts of anthropogenic pressures on 
tropical biodiversity and natural ecosystem services have led to 
increases in the number and extent of protected areas across the 
tropics’®. However, much remains unknown about the likelihood of 
biodiversity persisting in such protected areas. Remote-sensing tech- 
nologies offer a bird’s-eye view of tropical forests and provide many 
important insights®'"’, but are largely unable to discern crucial on- 
the-ground changes in forest biodiversity and ecological functioning”. 

To appraise both the ecological integrity and threats for tropical 
protected areas on a global scale, we conducted a systematic and 
uniquely comprehensive assessment of long-term changes within 60 
protected areas stratified across the world’s major tropical forest 
regions (Supplementary Fig. 1). To our knowledge, no other existing 
data set includes such a wide range of biodiversity and threat indicators 
for such a large and representative network of tropical reserves. Our 
study was motivated by three broad issues: whether tropical reserves 
will function as ‘arks’ for biodiversity and natural ecosystem processes; 
whether observed changes are mainly concordant or idiosyncratic 
among different protected areas; and what the principal predictors 
of reserve success or failure are, in terms of their intrinsic character- 
istics and drivers of change. 

To conduct our study we amassed expert knowledge from 262 
detailed interviews, focusing on veteran field biologists and environ- 
mental scientists who averaged nearly 2 decades of experience 
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(mean = s.d., 19.1 + 9.6 years) at each protected area. Each interviewed 
researcher completed a detailed 10-page questionnaire, augmented by a 
telephone or face-to-face interview (see Supplementary Information). 
The questionnaires focused on longer-term (approximately 20- 
30-year) changes in the abundance of 31 animal and plant guilds 
(trophically or functionally similar groups of organisms), which col- 
lectively have diverse and fundamental roles in forest ecosystems 
(Table 1). We also recorded data on 21 potential drivers of environ- 
mental change both inside each reserve and within a 3-km-wide buffer 
zone immediately surrounding it (Table 1). 

Our sample of protected areas spans 36 nations and represents a 
geographically stratified and broadly representative selection of sites 
across the African, American and Asia-Pacific tropics (Supplementary 
Fig. 1). The reserves ranged from 160 ha to 3.6 million ha in size, but 
most (85%) exceeded 10,000 ha in area (median = 99,350 ha; lower 
decile = 7,000 ha; upper decile = 750,000 ha). The protected areas fall 
under various International Union for Conservation of Nature 
(IUCN) reserve classifications. Using data from the World Database 
on Protected Areas (http://www.wdpa.org), we found no significant 
difference (P= 0.13) in the relative frequency of high-protection 
(IUCN Categories I-IV), multiple-use (Categories V-VI) and 
Table 1 | The 31 animal and plant guilds, and the 21 environmental 


drivers assessed both inside and immediately outside each protected 
area. 


Guilds 


Potential environmental drivers 


Broadly forest-dependent guilds 
Apex predators 
Large non-predatory species 


Changes in natural-forest cover 
Selective logging 


Primates Fires 

Opportunistic omnivorous mammals Hunting 

Rodents Harvests of non-timber forest products 
Bats Illegal mining 

Understory insectivorous birds Roads 


Automobile traffic 

Exotic plantations 

Human population density 
Livestock grazing 

Air pollution 

Water pollution 

Stream sedimentation 

Soil erosion 

River & stream flows 
Ambient temperature 
Annual rainfall 

Drought severity or intensity 
Flooding 

Windstorms 


Raptorial birds 

Larger frugivorous birds 
Larger game birds 

Lizards and larger reptiles 
Venomous snakes 
on-venomous snakes 
Terrestrial amphibians 
Stream-dwelling amphibians 
Freshwater fish 

Dung beetles 

Army or driver ants 

Aquatic invertebrates 
Large-seeded old-growth trees 
Epiphytes 

Other functional groups 
Ecological specialists 

Species requiring tree cavities 
igratory species 
Disturbance-favouring guilds 
Lianas and vines 

Pioneer and generalist trees 
Exotic animal species 

Exotic plant species 
Disease-vectoring invertebrates 
Light-loving butterflies 
Human diseases 
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‘Suffering’ 
reserves 


‘Succeeding’ 
reserves 


a. | a | 


Number of reserves 
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Figure 1 | Distribution of the ‘reserve-health index’ for 60 protected areas 
spanning the world’s major tropical forest regions. This relative index averages 
changes in 10 well-studied guilds of animals and plants, including disturbance- 
avoiding and disturbance-favouring groups, over the past 20 to 30 years. 


unclassified reserves between our sample of 60 reserves and all 16,038 
reserves found in the same tropical nations (Supplemen- 
tary Fig. 2). We also found no significant difference (P = 0.08) in the 
geographical isolation of our reserves (travel time to the nearest city 
with greater than 50,000 residents) relative to a random sample of 60 
protected areas stratified across the same 36 nations (Supplementary 
Fig. 3). 

We critically assessed the validity of our interview data by compar- 
ing them to 59 independent time-series data sets in which change in a 
single guild or environmental driver was assessed for one of our 
protected areas. Collectively, our meta-analysis included some data 
on 15 of the guilds, 13 of the drivers and 27 of the protected areas in 
our study (Supplementary Table 1). Most (86.4%) of the independent 
data sets supported our interview results, and in no case did an 
independent test report a trend opposite in sign to our interview-based 
findings. 

Our analyses suggest that the most sensitive guilds in tropical pro- 
tected areas include apex predators, large non-predatory vertebrates, 
bats, stream-dwelling amphibians, terrestrial amphibians, lizards and 
larger reptiles, non-venomous snakes, freshwater fish, large-seeded 
old-growth trees, epiphytes and ecological specialists (all P< 0.0056, 
with effect sizes ranging from —0.36 to — 1.05; Supplementary Table 2). 
Several other groups were somewhat less vulnerable, including 
primates, understory insectivorous birds, large frugivorous birds, 


Suffering reserve: 
1 
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raptorial birds, venomous snakes, species that require tree cavities, 
and migratory species (all P< 0.05, with effect sizes from —0.27 to 
—0.53). In addition, five groups increased markedly in abundance in 
the reserves, including pioneer and generalist trees, lianas and vines, 
invasive animals, invasive plants and human diseases (all P < 0.0056, 
with effect sizes from 0.44 to 1.17). 

To integrate these disparate data, we generated a ‘reserve-health 
index’ that focused on 10 of the best-studied guilds (data for each 
available at = 80% of reserves), all of which seem to be sensitive to 
environmental changes in protected areas. Six of these are generally 
“disturbance avoiders’ (apex predators, large non-predatory vertebrates, 
primates, understory insectivorous birds, large frugivorous birds 
and large-seeded old-growth trees) and the remainder seem to be 
“disturbance-favouring’ groups (pioneer and generalist trees, lianas 
and vines, exotic animals and exotic plants). For each protected area, 
we averaged the mean values for each group, using negative values to 
indicate increases in abundance of the disturbance-favouring guilds. 

The reserve-health index varied greatly among the different pro- 
tected areas (Fig. 1). About four-fifths of the reserves had negative 
values, indicating some decline in reserve health. For 50% of all 
reserves this decline was relatively serious (mean score <—0.25), with 
the affected organisms being remarkable for their high functional and 
taxonomic diversity (Fig. 2). These included plants with varying 
growth forms and life-history strategies, and fauna that differed widely 
in body size, trophic level, foraging strategies, area needs, habitat use 
and other attributes. The remaining reserves generally exhibited 
much more positive outcomes for biodiversity (Fig. 2), although a 
few disturbance-favouring guilds, such as exotic plants and pioneer 
and generalist trees, often increased even within these areas. 

An important predictor of reserve health was improving reserve 
management. According to our experts, reserves in which actual, 
on-the-ground protection efforts (see Supplementary Information) 
had increased over the past 20 to 30 years generally fared better than 
those in which protection had declined; a relationship that was con- 
sistent across all three of the world’s major tropical regions (Fig. 3). 
Indeed, on-the-ground protection has increased in more than half of 
the reserves over the past 20 to 30 years, and this is assisting efforts to 
limit threats such as deforestation, logging, fires and hunting within 
these reserves (Supplementary Table 3), relative to areas immediately 
outside (Supplementary Table 4). 

However, our findings show that protecting biodiversity involves 
more than just safeguarding the reserves themselves. In many 
instances, the landscapes and habitats surrounding reserves are under 
imminent threat**’> (Fig. 4 and Supplementary Tables 3 and 4). For 
example, 85% of our reserves suffered declines in surrounding forest 
cover in the last 20 to 30 years, whereas only 2% gained surrounding 
forest. As shown by general linear models (Supplementary Table 5), 
such changes can seriously affect reserve biodiversity. Among the 


Ss Succeeding reserves 


Exotic plants 

Large non-predatory spp. 
Primates 

Pioneer and generalist trees 
Apex predators 

Exotic animals 

Large-seeded old-growth trees 
Lianas and vines 

Raptorial birds 

Larger game birds 

Human diseases 

Larger frugivorous birds 
Understory insectivorous birds 
Cavity-requiring spp. 
Freshwater fish 


< Worsening (%) 


Figure 2 | Percentages of reserves that are worsening versus improving for key 
disturbance-sensitive guilds, contrasted between ‘suffering’ and ‘succeeding’ 
reserves (which are distinguished by having lower (<—0.25) versus higher 
(2—0.25) values for the reserve-health index, respectively). For disturbance- 


! ! ! ! ! : 
-80 -60-40 -20 0 20 40 60 
Improving (%) —> < Worsening (%) 


\ 1 1 1 1 1 1 
80 -80 -60-40 -20 0 20 40 60 80 


Improving (%) —> 
favouring organisms such as exotic plants and animals, pioneer and generalist 
trees, lianas and vines, and human diseases, the reserve is considered to be 


worsening if the group increased in abundance. For any particular guild, 
reserves with missing or zero values (no trend) are not included. 
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=—e— Africa 


=em— Americas 


=—@— Asia-Pacific 


Reserve health 


-1 -0.5 0 0.5 1 


<— Worsening Improving —> 


Change in reserve protection 


Figure 3 | Effects of improving on-the-ground protection on a relative index of 
reserve health. This positive relationship held across all three tropical 
continents (a general linear model showed that the protection term was the 
most effective predictor of reserve health (Akaike’s information criterion 
weight, 0.595; deviance explained, 11.4%), with the addition of ‘continent’ 
providing only a small improvement in model fit (Akaike’s information 
criterion weight, 0.317; deviance explained, 16.3%). 


potential drivers of declining reserve health, three of the most import- 
ant predictors involved ecological changes outside reserves (declining 
forest cover, increasing logging and increasing fires outside reserves; 
Supplementary Fig. 6). The remainder involved changes within 
reserves (particularly declining forest cover and increasing hunting, 
as well as increasing logging and harvests of non-timber forest pro- 
ducts; Supplementary Table 5). 

Thus, changes both inside and outside reserves determine their 
ecological viability, with forest disruption (deforestation, logging and 
fires), and overexploitation of wildlife and forest resources (hunting 


Population growth 


Stream sedimentation 
Water pollution [ 
Road expansion ji 


Automobile traffic | 


Population growth 


Stream sedimentation 
Water pollution 
Road expansion 


Automobile traffic 


and harvests of non-timber forest products) having the greatest direct 
negative impacts. Other environmental changes, such as air and water 
pollution, increases in human population densities and climatic 
change (changes in total rainfall, ambient temperature, droughts and 
windstorms) generally had weaker or more indirect effects over the last 
20 to 30 years (Supplementary Table 5). 

Environmental degradation occurring around a protected area 
could affect biodiversity in many ways, such as by increasing reserve 
isolation, area and edge effects’*"’’. However, we discovered that its 
effects are also more insidious: they strongly predispose the reserve 
itself to similar kinds of degradation. Nearly all (19 of 21) of the 
environmental drivers had positive slopes when comparing their 
direction and magnitude inside versus outside reserves (Fig. 5). 
Among these, 13 were significant even with stringent Bonferroni cor- 
rections (P <0.0071) and 17 would have been significant if tested 
individually (P< 0.05). As expected, the associations were strongest 
for climate parameters but were also strong for variables describing air 
and water pollution, stream sedimentation, hunting, mining, harvests 
of non-timber forest products and fires. To a lesser extent, trends in 
forest cover, human populations, road expansion and automobile traffic 
inside reserves also mirror those occurring outside reserves (Fig. 5). 

Our findings signal that the fates of tropical protected areas will be 
determined by environmental changes both within and around the 
reserves, and that pressures inside reserves often closely reflect those 
occurring around them. For many reasons, larger reserves should be 
more resilient to such changes’*~’, although we found that removing 
the effects of reserve area statistically did not consistently weaken the 
correlations between changes inside versus outside protected areas 
(Supplementary Table 6). 

Our study reveals marked variability in the health of tropical pro- 
tected areas. It indicates that the best strategy for maintaining biodi- 
versity within tropical reserves is to protect them against their major 
proximate threats, particularly habitat disruption and overharvesting. 
However, it is not enough to confine such efforts to reserve interiors 
while ignoring their surrounding landscapes, which are often being 
rapidly deforested, degraded and overhunted**"*"* (Fig. 5). A failure to 
limit interrelated internal and external threats could predispose 
reserves to ecological decay, including a taxonomically and functionally 


Inside reserves 


Forest cover 
Logging [ 
Fires 


Soil erosion 


Outside reserves 


Forest cover 
Logging 
Fires 


Soil erosion 


-100 -50 0) 50 100 


<— Worsening (%) — Improving (%) —> 


Figure 4 | Comparison of ecological changes inside versus outside protected areas, for selected environmental drivers. The image is an example of the strong 
distinction in disturbance inside versus outside a reserve. The bars show the percentages of reserves with improving versus worsening conditions. 
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Temperature 

Rainfall 

Windstorm disturbance 
Flooding 

Droughts 

Air pollution 

Water pollution 
Stream sedimentation 
Hunting 

River flows 

Illegal mining 

NTFP harvests 

Fires 

Automobile traffic 
Forest cover 
Population growth 
Road expansion 

Soil erosion 

Selective logging 
Exotic-tree plantations 
Livestock grazing 
-0.2 0 0.2 0.4 0.6 0.8 


MP <0.0071 
BM P<0.05 
za) Nonsignificant 


Correlation coefficient (7) 


Figure 5 | Pearson correlations comparing the direction and strength of 21 
environmental drivers inside versus outside tropical protected areas. NTFP, 
non-timber forest products. 


sweeping array of changes in species communities (Fig. 2) and an 
erosion of fundamental ecosystem processes'*"*”*. 

Protected areas are a cornerstone of efforts to conserve tropical 
biodiversity**!*"". It is not our intent to diminish their crucial role 
but to highlight growing challenges that could threaten their success. 
The vital ecological functions of wildlife habitats surrounding protected 
areas create an imperative, wherever possible, to establish sizeable 
buffer zones around reserves, maintain substantial reserve connectivity 
to other forest areas and promote lower-impact land uses near reserves 
by engaging and benefiting local communities*"***~”. A focus on man- 
aging both external and internal threats should also increase the resi- 
lience of biodiversity in reserves to potentially serious climatic 
change**~ in the future. 


METHODS SUMMARY 


Our interview protocol, rationale, questionnaire and data analyses are detailed in 
the Supplementary Information. We selected protected areas broadly to span the 
African, American and Asia-Pacific tropics (Supplementary Fig. 1), focusing on 
sites with mostly tropical or subtropical forest that had at least 10 refereed pub- 
lications and 4-5 researchers with long-term experience who could be identified 
and successfully interviewed. 

We devised a robust and relatively simple statistical approach to assess temporal 
changes in the abundance of each guild and in each potential environmental driver 
across our reserve network (see Supplementary Information). In brief, this involved 
asking each expert whether each variable had markedly increased, remained stable 
or markedly declined for each reserve. These responses were scored as 1, 0 and —1, 
respectively. For each response, the expert was also asked to rank their degree of 
confidence in their knowledge. After discarding responses with lower confidence, 
scores from the individual experts at each site were pooled to generate a mean value 
(ranging from —1.0 to 1.0) to estimate the long-term trend for each variable. 

The means for each variable across all 60 sites were then pooled into a single data 
distribution. We used bootstrapping (resampling with replacement; 100,000 itera- 
tions) to generate confidence intervals for the overall mean of the data distribution. If 
the confidence intervals did not overlap zero, then we interpreted the trend as being 
non-random. Because we tested many different guilds, we used a stringent Bonferroni 
correction (P = 0.0056) to reduce the likelihood of Type I statistical errors, although 
we also identified guilds that showed evidence of trends (P = 0.05) if tested individu- 
ally. For comparison, we estimated effect sizes (bootstrapped mean divided by s.d., 
with negative values indicating declines) for changes in guild abundances and for 
potential drivers inside and outside reserves (Supplementary Tables 2-4). 
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Activity in motor-sensory projections reveals 
distributed coding in somatosensation 


Leopoldo Petreanu'}, Diego A. Gutnisky’, Daniel Huber‘}, Ning-long Xu', Dan H. O’Connor', Lin Tian}, Loren Looger! 


& Karel Svoboda! 


Cortical-feedback projections to primary sensory areas terminate 
most heavily in layer 1 (L1) of the neocortex '”, where they make 
synapses with tuft dendrites of pyramidal neurons. L1 input is 
thought to provide ‘contextual’ information’, but the signals trans- 
mitted by L1 feedback remain uncharacterized. In the rodent 
somatosensory system, the spatially diffuse* feedback projection 
from vibrissal motor cortex (vM1) to vibrissal somatosensory 
cortex (vS1, also known as the barrel cortex) may allow whisker 
touch to be interpreted in the context of whisker position to 
compute object location®®. When mice palpate objects with their 
whiskers to localize object features”*, whisker touch excites vS1° 
and later vM1 in a somatotopic manner’®'*. Here we use axonal 
calcium imaging to track activity in VM1—vS1 afferents in L1 of 
the barrel cortex while mice performed whisker-dependent object 
localization. Spatially intermingled individual axons represent 
whisker movements, touch and other behavioural features. In a 
subpopulation of axons, activity depends on object location and 
persists for seconds after touch. Neurons in the barrel cortex thus 
have information to integrate movements and touches of multiple 


whiskers over time, key components of object identification and 
navigation by active touch. 

We trained head-fixed mice to perform a whisker-based object- 
localization task under a two-photon microscope with one whisker 
row (C row) (Fig. la)'’. In each trial, a pole was moved into one of 
several locations within reach of the whiskers (sampling period, 
approximately 1s; Supplementary Fig. la), arranged along the 
anterior—posterior axis on one side of the head (Fig. 1b). Mice moved 
their whiskers to determine whether the pole was either in one of 
several ‘go’ locations or in a ‘no-go’ location. Automated whisker 
tracking'* measured whisker movements and shape. The force acting 
on the follicles, which underlies object localization*!*'*®, can be 
estimated from whisker shape changes induced by touch. A change 
in curvature at point p of the whisker is proportional to the force 
applied by the pole on the whisker'®: F x Ak, yp, where y, is 
the bending stiffness at p. We thus present forces acting on the 
whiskers as the change in curvature, 4k (Supplementary Fig. 1b). 
Mice report their decision about object location with licking. 
Imaging was performed in trained mice (descriminability (d’) average, 
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Figure 1 | Imaging activity in vM1—vS1 axons during whisker-based object 
localization. a, Head-fixed mouse performing an object-localization task 
under the microscope. b, One frame of a high-speed video sequence and typical 
pole positions corresponding to multiple go locations (hues of blue) anda single 
no-go location (red). Whisker touch was possible for all object locations for the 
long posterior whiskers. The azimuthal angle @ describes whisker position (0, 
denotes azimuthal angle at first touch). c, VM1 neurons were infected with 
GCaMP3-expressing virus and their axons were imaged in L1 of vS1. d, Intrinsic 
optical-imaging signals corresponding to deflections of whiskers C1, C2 and C3, 
overlaid on a brightfield image of the vasculature. e, Two-photon images 
showing vM1 axons in the Cl barrel column. f, Field of view used for in vivo 
imaging showing varicosities (arrowheads correspond to varicosities used for 


Mean activity (events per frame) 


fluorescence time series in g). g, Fluorescence dynamics over multiple 
behavioural trials. Coloured bars indicate the sampling period, when the pole is 
within reach (blue, go trials; red, no-go trials). h, Matrix of correlation 
coefficients for activity in different varicosities in one field of view. Arrowheads 
correspond to varicosities marked in f. i, Distribution of correlation coefficients 
for all fields of view. Black line denotes pairs of varicosities on the same axon (78 
varicosities, 31 axons) and grey line denotes all other varicosity pairs (51,325 
varicosities). Correlation coefficients were computed over the entire session. 
j, Correlation coefficients as a function of mean activity level (black denotes 
pairs of varicosities on the same axon and grey denotes all other pairs). The solid 
line is an exponential fit (y = a(1 — e ~>x) + ¢) to the black circles and the 
dashed lines represent the 95% confidence interval. 
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1.8, corresponding to 80% correct trials; range, 1.2-2.6) (Supplemen- 
tary Fig. 1c-e). 

To image activity in VM1—>vS1 axons we expressed the genetically 
encoded calcium indicator GCaMP3 (ref. 17) in vVM1 neurons” and 
implanted a chronic imaging window over vS1 (ref. 18) (Fig. Ic). 
Imaging was in barrel columns with principal whiskers C1, C2 or 
C3, based on intrinsic signal imaging’ (Fig. 1d). Fluorescent axons 
were abundant in L1 (Fig. le, f and Supplementary Fig. 2a, b). 
GCaMP3-positive axonal varicosities appeared as bright spots along 
the axon backbone”’. Two-photon time-lapse images (field of view, 
40 um X 40 um) were acquired using raster scanning (16 Hz) over 
100-200 trials per imaging location (trial duration, 7 s) (Fig. 1f, g 
and Supplementary Fig. 1c, d). 

Action potentials reliably invade cortical axonal arborizations and 
cause calcium accumulations in varicosities’, which harbour synaptic 
terminals. We measured GCaMP3-mediated fluorescence transients, 
corresponding to trains of action potentials (=5; Supplementary 
Fig. 2c-f), in regions of interest (ROIs) containing individual 
varicosities (range, 35-80 varicosities per field of view; 17 fields of 
view; 6 mice). Fluorescence transients were detected in multiple 
ROIs within each field of view (range, 6-20 active varicosities per field 
of view) (Figs 1f and 2a). As expected, the fluorescence signals from 
varicosities belonging to the same axon were highly correlated, 
whereas correlations computed across all other pairs were low 
(Fig. 1g). These correlations allowed us to identify varicosities on the 
same axon, even in cases where the local arborization could not be 
reconstructed morphologically (Fig. 1h-j). Below we report the activity 
of one varicosity per axon to represent the activity of the entire axon. 
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Only active axons were analysed (Methods). This ensured that almost 
all (>99%) fluorescence signals reported neural activity rather than 
movement (Supplementary Fig. 3). 

We aligned fluorescence time series with recordings of behavioural 
features, including 4x, whisker movements (whisking) and lick rate, 
and grouped trials by trial type (Fig. 2). Axonal activity was concen- 
trated around the sampling period, and correlated with specific beha- 
vioural variables. Some axons were active coincident with whisking 
before and during the sampling period (Fig. 2b-d; compare ROI] and 
whisking amplitude), whereas other axons were mainly active in trials 
with strong whisker touch (compare ROI2 and Ak). Some axons were 
primarily active during licking or could not obviously be explained by a 
single behavioural feature’*. Correlations between activity and beha- 
viour were apparent across trials and within trials. These data indicate 
that individual vM1—vS1 axons represent specific behavioural 
features, but the representations are diverse across the population of 
axons. 

We used a generalized form of regression (Random Forests, 
Methods) to quantify how behavioural features are represented by 
vM1->vS1 axons”. Both behaviour and activity varied within trials, 
across individual trials and across trial types (Fig. 2b, c). For example, 
in some trials whisker touch was strong, whereas in other trials it was 
weak or absent (touch in no-go trials was typically the weakest; 
Supplementary Fig. 1b, e). In some trials mice whisked gingerly and 
late during the sampling period (Fig. 2b, trial 144), whereas in other 
trials they whipped their whiskers against the pole as soon as it came 
within reach (Fig. 2b, trial 145). Our algorithms exploit this variability 
and the large number of trials in our data set (Supplementary Fig. 1d) 
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Figure 2 | Motor and sensory signals in VM1—>vS1 axons. a, Image of the 
field of view and ROIs (red) corresponding to all visible axonal varicosities. 
Yellow numbers indicate ROI] and ROI2. b, Activity (AF/Fo) of two individual 
axons (ROI and ROI2) across three trials of one behavioural session, aligned 
with different behavioural features (whisker position, whisking set-point, 
whisking amplitude, 4x, licking (ticks)). Protraction forces correspond to 
negative Ax. Coloured bars indicate the sampling period (blue, go trials; red, 
no-go trials). c, Activity across an entire session for two axons (AF/Fy ROI; AF/Fo 
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ROI2). Each row corresponds to a trial. Dashed white lines indicate the sampling 
period when the pole is within reach. Trial type is indicated on the far left (hues of 
blue correspond to different object locations in go trials, as indicated in Fig. 1b). 
Only correct trials are shown. Right, behavioural variables (see Methods). 4x 
due to protractions; retractions are not shown. Whisking amplitude is the 
amplitude of the band-pass filtered (6-30 Hz) whisker angle. Lick rate is the 
instantaneous rate of tongue protractions. d, Activity (left) and behavioural 
variables (right) averaged across trial types. Shading represents s.e.m. 
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to quantify how well specific behavioural features could be decoded 
from axonal activity. 

We first decoded the measured behavioural features (4; whisking 
and licking; Methods) on the basis of activity in all axons in a field of 
view (Fig. 3a). The algorithms used the activity of populations of axons 
to fit individual behavioural features, taking into account variability 
within and across trials. The explained variance (R?, for the ith beha- 
vioural feature) was used to measure the quality of decoding 
(Supplementary Fig. 4). 

Population activity decoded the recorded behavioural features, 
including Ax, whisking and lick rate (Fig. 3a). Whisking was decom- 
posed into set-point (<1.6 Hz) and amplitude (6-30 Hz; Methods)*” 
(Fig. 2b), but these parameters were highly correlated (Supplementary 
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Fig. 5a) and decoding was similar for these parameters. All imaged 
fields of view decoded all behavioural features (R* larger than for 
trial-shuffled data; P< 0.001, bootstrap test for all sessions and fea- 
tures (Supplementary Fig. 4a)). This is despite the fact that each experi- 
ment sampled only approximately 1 X 10~°% of the synapses in vS1 
(assuming: thickness of L1 in vS1, 0.09 mm (ref. 22); vS1 area, 2 mm’; 
synaptic density, 1 x 10° per mm (refs 3 and 23)). We conclude that 
vM1->vS1 axons relay signals related to touch, whisking, licking and 
other task-related variables to vS1 in a distributed and highly redund- 
ant manner. 

We used similar methods to classify individual axons on the basis of 
their activity patterns’* (Methods; Fig. 3b-e). One half of the active 
axons (78 out of 138 active axons; 56%) decoded one or more of the 


Figure 3 | Decoding behavioural variables on the 
basis of axonal activity. a, Decoding on the basis 
of axon populations. Time series of behavioural 
features down-sampled to 16 Hz (black) and the 
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Random Forests model prediction (pink) based on 
the activity of all axons in one field of view. Vertical 
bars indicate the sampling periods and the trial 
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protractions (whisker C2), a measure of contact 
force. Centre, whisking amplitude. Bottom, lick 
rate. b-e, Decoding behavioural features on the 


basis of the activity of individual axons. Axons were 
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classified based on correlations with behavioural 
features (138 axons, 6 animals, 17 sessions). 

b, Peristimulus time histograms averaged across 
axons with shared activity patterns and trial types, 
in standardized units (z-score). Hits (blue) are 
correct go trials, correct rejections (red) are correct 
no-go trials, and false alarms (green) are incorrect 
no-go trials. Incorrect go trials were rare and are 
not shown. Correct trials and error trials were used 
separately in classification. N, number of animals; 
n, number of axons. 1, touch axons; 2, touch axons 
with persistent activity; 3, mixed axons; 4, whisking 
axons, late activity; 5, whisking axons, early activity; 
6, licking axons; 7, axons that are selectively active 
during false alarms; _ these were unclassified using 
the behavioural features used. Grey shading 
represents s.e.m. c, Rows correspond to axons. 
Task-aligned activity was averaged over trials of 
each type. Rows were ordered on the basis of 
clustering by activity pattern (Methods). 

d, Correlation between the Random Forests model 
and different behavioural features (R,7) after 
classification (Methods). The hue indicates the 
strongest correlation with one of the features (cyan, 
touch; magenta, mixed; green, whisking; red, 
licking). Black and red arrowheads indicate axons 
showing object-location-dependent activity. Red 
arrowheads indicate significantly persistent axons 
(Fig. 4). e, Axons decoding different behavioural 
features were spatially intermingled (three sessions 
from three different animals). Left panel, same 
axons as in Fig. 2a; right panel, same axons as in 
Fig. 1f. 
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measured behavioural features (mean R* for best feature, 0.22). 
Whisking-related axons (26 out of 138 active axons) were active during 
whisking for all trial types, independent of touch (Fig. 2; ROI). 
Whisking-related fluorescence signals lagged whisking onset (mean 
+ s.d., 259 + 101 ms) (Supplementary Fig. 5b-d). However, because 
of the slow dynamics of calcium-dependent fluorescence we cannot 
exclude the possibility that the underlying spikes might lead whisking. 
Coding of whisking amplitude and whisking set-point is consistent 
with neurophysiological* and imaging experiments in vM1 (ref. 12). 
Touch-related axons (31 out of 138 active axons) were activated 
mainly in trials with touch, but not by whisking alone (Fig. 4b; 
Supplementary Fig. 4c). Consistently, trimming whiskers, which 
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Figure 4 | Persistent object-location-dependent activity. a, Activity of an 
example axon for different object locations (hues of blue on the left; compare 
with Fig. 1b; only correct go trials are shown). Activity was aligned to the start of 
the trial (vertical dashed lines indicate presence of the pole). b, Activity 
averaged across trials with touch (black) and without touch (grey) aligned to 
start of trial. c, Same data as in a, aligned to the first touch (vertical line). Ticks, 
last touch in each trial. Bottom, activity averaged across object locations, 
aligned to first touch (hits). Grey shading indicates s.e.m. d, Same data as in 
a, aligned to the last touch. Bottom, activity averaged across object locations, 
aligned to last touch. e, Activity as a function of 0 at touch. Same axon as in a is 
shown. For averaging, points were grouped by object location. Black line 
denotes linear regression (R?, 0.41;P=2.5X 10°). Error bars denote s.d. 

f, Activity averaged across object locations, aligned to the last touch and 
deconvolved to correct for the dynamics of calcium and GCaMP3 fluorescence 
(same axon as in a) (see Supplementary Fig. 8). g, Decay time of fluorescence 
after last touch for all axons showing object-location-dependent activity 

(n = 25). The dashed vertical line indicates T,/2 for GCaMP3 fluorescence 
without persistent activity. h, Average fluorescence signals for axons showing 
persistent activity (n = 11). Activity was normalized to the value at the time of 
the last touch. i, The fraction of trials with correctly decoded object location as a 
function of time for the example axon shown in a-d. Dashed line indicates 
chance level (0.25, corresponding to four object locations). Horizontal line 
indicates time of significant decoding (*P < 0.05). Grey shading indicates 
s.e.m. j, Decoding of object location (same group as in h; black line, average; red 
line, fraction of axons decoding above chance level). 
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precluded touch during whisking, abolished activity of touch-related 
axons (Supplementary Fig. 6). Other axons decoded lick rate (classified 
as lick related)”° or a mixture of features (classified as ‘mixed’ axons) 
(Fig. 3d). 

Axons representing different behavioural features were spatially 
intermingled over length scales of micrometres (Figs 2 and 3e); nearby 
varicosities were equally likely to be part of any of the representations. 
Repeated imaging experiments showed that the representations of 
individual axons were largely stable (Supplementary Fig. 6). The L1 
apical tuft dendrites of individual neurons in vS1 may have access to a 
rich amalgam of contextual information. 

Most of the touch-related axons showed activity that varied with 
object location (Fig. 3d, arrow heads; Fig. 4a-c and Supplementary 
Fig. 7) (25 out of 138 active axons). These axons became active only 
after first touch (Fig. 4a—c) (latencies greater than 100 ms) and might 
thus be driven in part by touch-triggered excitation spreading from vS1 
to vM1 (refs 10, 11 and 13). In many cases, activation of these axons 
correlated with forces on a specific whisker. However, this whisker was 
typically not the principal whisker of the imaged barrel column 
(Supplementary Fig. 7). Touch-related activity corresponding to 
specific whiskers is thus broadcast widely to the barrel cortex, consist- 
ent with the diffuse nature of the vMl—vS1 projection*. These 
data implicate the primary motor cortex in somatosensation”’. 
Furthermore, signals coding for object location are fed back from the 
motor cortex to the somatosensory cortex. 

Fluorescence changes coding for object location often outlasted the 
presence of the pole, sometimes for several seconds (Fig. 4d—h and 
Supplementary Fig. 8). This is much longer than expected for the 
decay-time constant of GCaMP3 fluorescence after trains of action 
potentials in axons (half-decay time, T,2, 0.45 s; Supplementary Fig. 2) 
(Fig. 4g, h). After taking calcium dynamics and GCaMP3 fluorescence 
into account, significant graded, persistent activity was seen in 11 out of 
25 object-location-sensitive axons (two-way mixed analysis of variance 
(ANOVA), P< 0.05), lasting from 0.5s to more than 4 s (Fig. 3d, red 
arrowheads; Fig. 4g, h). Motor behaviours did not explain object- 
location-dependent persistent activity; licking and whisking were 
not different across trials with different object locations (two-way 
mixed ANOVA, 9 out of 11 persistent object location axons, 
P> 0.05; Supplementary Fig. 8f, g). Object location could be decoded 
from the activity of persistent axons for up to 4s after the last contact 
(Fig. 4 i, j). This parametric persistent activity’ represents a form 
of short-term memory of objects detected by specific whiskers at 
particular locations. 

We used calcium imaging with genetically encoded indicators to 
track activity in specific projections (Fig. 1). Small patches of L1 in vS1 
receive diverse signals from vM1, including activity coding for aspects 
of whisking, touch and licking (Figs 2 and 3). Most of the signals coded 
by L2/3 neurons in vM1 (ref. 12) were also represented in vVM1 axons in 
L1 of the vS1. Touch-related and persistent activity was overrepre- 
sented in the VM1—vS1 axons, but this difference could be due to 
variations in the behavioural task or the sampled neuronal population. 

Pyramidal neurons receive this rich top-down information in their 
tuft branches in L1, whereas bottom-up sensory input impinges 
mainly on the proximal basal dendrites”. L1 input increases neuronal 
gain and can promote bursting with coincident input in the proximal 
basal dendrites”. Pyramidal neurons therefore compare contextual 
input and ongoing sensory input. 

Activity related to whisker position was a prominent component of 
the VM1—vVS1 signals (Figs 2 and 3). Pyramidal neurons in vS1 might 
combine this efference copy-like positional information and touch 
input to compute object location in vS1 (refs 6 and 15). Whisking 
signals impinging on L1 dendrites might also selectively amplify 
activity related to touch during periods of active exploration. 

Although mice responded within approximately 100 ms after touch 
in the object-localization task, a subpopulation of VM1—vS1 axons 
showed persistent object-location signals that outlasted the decision of 
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the animals by seconds (Fig. 4). These memory traces, which might 
not be relevant to solve this particular task, represent information 
about touches of specific whiskers in the recent past. Haptic object 
recognition is constructed from localization of several related object 
features, based on multiple touches over time”*’? with multiple 
whiskers*’. The vM1—vS1 projection links the past state of specific 
whiskers (input in L1) with present sensory input from other whiskers, 
ascending into vS1 through the sensory thalamus. The vS1<>vM1 loop 
thus has the capacity to integrate haptic information across time and 
space, key components of object recognition and somatosensory 
navigation. Pyramidal cells in vS1 receive L1 inputs related to past 
touch, whereas their proximal basal dendrites receive input signalling 
present touch. These neurons could selectively burst after specific 
sequences of touches with different whiskers, within time windows 
defined by the mnemonic activity in L1. In this scheme burst firing of 
populations of vS1 neurons codes for complex object shape 
(Supplementary Fig. 9). 


METHODS SUMMARY 


Virus expressing GCaMP3 (rAAV-synapsin (SYN)-GCaMP3, serotype 2/1, 
University of Pennsylvania Gene Therapy Program Vector Core) was injected 
(20nl, 10 nl min ', 350 um deep) into the left vM1 of adult (> P60) C57BL/6 male 
mice and a circular craniotomy was made over the ipsilateral vS1. Animals were 
trained in a whisker-dependent object-localization task. vM1 axons were imaged 
in the vS1 14-23 days after virus injection. Images of the whiskers were acquired 
using a high-speed camera at 500 frames per second. The whisker position 
(azimuthal angle at the whisker base, 0) and whisker shape were tracked using 
automated whisker tracking* '*. Licking was detected using an optical lickport. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Virus injection and chronic-window preparation. All procedures were 
approved by the Janelia Farm Research Campus Institutional Animal Care and 
Use Committee. Surgeries were conducted on adult (>P60) male C57BL/6 mice 
under isoflurane anaesthesia (1.5-2%). Other drugs reduced potential inflam- 
mation (Ketofen, 5mg kg, subcutaneously) and provided local (Marcaine, 
0.5%, injected under the scalp) and general analgesia (buprenorphine, 0.1 mg 
kg, intraperitoneally). To label motor cortex axons a small craniotomy was 
performed over the left vM1 (coordinates relative to bregma, anterior, 1.3 mm; 
lateral, 0.7mm). Virus expressing GCaMP3 (rAAV-synapsin(SYN)-GCaMP3, 
serotype 2/1, University of Pennsylvania Gene Therapy Program Vector Core) 
was injected (20 nl, 10 nl min |, 350 jum deep) with a custom-made volumetric 
injection system (based on a Narishige MO-10 manipulator)’. Glass pipettes 
(Drummond) were pulled and bevelled to a sharp tip (30 {1m outer diameter). 
Pipettes were back-filled with mineral oil and front-loaded with viral suspension 
immediately before injection. Compared to synthetic calcium indicators, expres- 
sion of protein sensors ensures labelling of long-range axons and also allows 
imaging in trained mice across multiple behavioural sessions, separated by days 
to weeks. The virus infected neurons in layers 2-5, including most vS1-projecting 
neurons'* (Supplementary Fig. 2a). A custom-machined titanium frame was 
cemented to the skull with dental acrylic (Lang Dental). 

For imaging, a circular craniotomy was made over the left barrel cortex (vS1; 

diameter, 1.5 mm; centre relative to bregma, lateral, 3.5 mm; anterior, —1.4 mm; 
left hemisphere, Fig. 1d). An imaging window was constructed from two layers of 
standard microscope coverglass (Fisher; thickness, 170-210 |tm), joined with an 
ultraviolet curable optical glue (NOR-61, Norland). A larger piece was attached to 
the bone anda smaller insert fit snugly into the craniotomy. The bone surrounding 
the craniotomy was thinned to allow for a flush fit between the insert and 
underlying dura. The window was cemented in place using dental acrylic (Lang 
Dental). After 3 days of recovery the C-row whiskers were mapped using intrinsic 
signal imaging’ (Fig. 1d). Mice were anaesthetized with isoflurane (1%) after 
injection of chlorprothixene (1mgkg ')*". Images were acquired through a 
cranial window through a Leica MZ12.5 microscope under 630 nm illumination. 
Images of the vasculature over the same field of view were taken under 530 nm 
light-emitting diode illumination. Individual whiskers were moved using a 
piezoelectric bimorph (0.75mm, 3mm from the base; 10Hz; 4s, repeated 
every 20s; total time, 10-30min). Acquisition was performed with Ephus 
(http://www.ephus.org)**. Water restriction was started 1 day later. 
Behaviour. The behavioural task was modified from previous studies*'* (Fig. la, b 
and Supplementary Fig. 1). Behavioural training began after the mice had 
restricted access to water for at least 7 days (1 ml per day). In the initial session 
mice first learned to lick for water rewards (~40 rewards). Animals were then 
trained to lick only in trials in which a metal pole was within easy reach of the 
whiskers on the right side of their face. The pole was mounted on a pneumatic 
linear slider (Festo; SLS-10-30-P-A Mini slide; P/N 170496) and was moved 
rapidly (~0.25s) in and out of reach of the whiskers under computer control. 
The pole was within reach of the whiskers for ~1s (sampling period). Tongue 
movements were tracked using an optical lickport. 

Because the pole was placed in one of five randomly selected locations, it was 
impossible for mice to position their whiskers to solve the task with passive 
whisker stimulation. However, consistent with previous studies*, mice tended to 
focus their whisking on the rewarded target locations. Passive stimulation by the 
moving pole thus occurred in some trials (23% of trials; 6 sessions, 5 animals). In 
general, mice whisked to contact the pole, but whisking varied across trials (Fig. 2b, 
c and Supplementary Fig. 1). Whisking strategies also varied across mice. After 
removal of the pole, during an answer period lasting ~1.5 s, licking was rewarded 
with a drop of water in go trials and punished with an extra inter-trial interval of 4 s 
in no-go trials. Licking was ignored in other behavioural epochs. Multiple pole 
positions in go trials were introduced at this early training stage. The range of go 
positions was initially relatively small (4.5 mm) and was gradually increased with 
training. The pole positions were (relative to the C2 follicle at rest) (Supplementary 
Fig. 1b): lateral, 6.7-9.1 mm. Go positions, centre of anterior-posterior range, 
0.35—-3.32 mm; anterior—posterior range, 5.6-7.0 mm; no-go positions, anterior— 
posterior location, 8.9-13.6 mm lateral, 6.7-9.1 mm. Although all pole locations 
were within reach of the long posterior whiskers (C1, C2), mice did not always 
touch the most anterior (no-go) location® (Supplementary Fig. le). 

Once the animals were trained (d' > 1), all whiskers apart from C row were 
trimmed on the right side of the face. Subsequent behavioural sessions were 
performed while imaging under the two-photon microscope. 

Imaging. Imaging was performed 14-23 days after virus injection with a custom- 
made microscope (design available at http://research janelia.org/Svoboda/). 
GCaMP3 was excited using a Ti:sapphire laser (Mai Tai, Spectra Physics) tuned 
to 4 = 925 nm. We used GaAsP photomultiplier tubes (10770PB-40, Hamamatsu) 


and a X16 (0.8NA) microscope objective (Nikon). The field of view was 
40 X 40 um (128 X 256 pixels) (Fig. 1f), imaged at 16 Hz. The microscope was 
controlled with ScanImage” (http://www.scanimage.org). The average power for 
imaging was <50 mW, measured at the entrance pupil of the objective. For each 
mouse the optical axis was adjusted to be perpendicular to the imaging window. 
Imaging was stopped during inter-trial intervals (3 s; 7 s after false-alarms trials). 
Bleaching of GCaMP3 was negligible. Slow drifts of the field of view were corrected 
manually approximately every 50 trials using a reference image. In some cases 
we imaged the same axons over several days (Supplementary Fig. 6). Vascular 
landmarks and visual comparison with reference images from previous days were 
used to identify the same axons. 

Calcium imaging has some drawbacks. Presently used genetically encoded 
indicators are not sufficiently sensitive to detect single action potentials, and as a 
consequence axons with low firing rates were probably missed*” (Supplementary 
Fig. 2). This implies that the representations in VM1—>vS1 axons are probably even 
richer than suggested by our imaging data. In addition, the slow dynamics 
(hundreds of milliseconds) of the calcium indicator limits the temporal resolution 
of the neurophysiological measurements. Advances in the development of fluor- 
escent sensors of neuronal function will probably ameliorate these problems. 
Image analysis. Frames were registered using rigid translation based on cross- 
correlation. First, we selected a trial with little movement and the frames within 
the trial were registered and averaged. All of the frames from a session were 
registered to this averaged image. To extract fluorescence signals, ROIs were 
drawn over fluorescent varicosities identified by using the mean, maximum 
intensity and standard deviation values of all trials. The pixels in each ROI were 
averaged to estimate fluorescence corresponding to a single varicosity. The 
baseline fluorescence of the ROI, Fo, was estimated as the 30th percentile of the 
fluorescence using a 32 s sliding window and used to calculate AF/Fy = ((F — Fo)/Fo). 
To produce an event vector from the AF/Fp trace, and thereby minimize the 
temporal distortions caused by GCaMP3 dynamics'’, we used a non-negative 
deconvolution method*’. These event vectors were used to train the decoder and 
to measure average activity when selecting for active axons. Active axons were 
defined as having an average event rate higher than 0.007 events per frame and a 
peak AF/F in the peristimulus time histograms higher than 30% for any trial type 
(that is, hit, correct rejection or false alarm). This assures that the calcium signals 
are not due to motion artefacts (less than 1% of the GCaMP3 signals are expected to 
be polluted by movement) (Supplementary Fig. 3). 

Whisker tracking. Whiskers were illuminated using a high-power light-emitting 
diode (940 nm, Roithner) and condenser optics (Thorlabs). Images were acquired 
through a telecentric lens (x 0.36, Edmund Optics) by a high-speed CMOS camera 
(EoSense CL, Mikrotron) running at 500 frames per second (640 X 352 pixels; 
42 pixels per mm). Image acquisition was controlled by Streampix 3 (Norpix, 
Canada). The whisker position (0) and whisker shape were tracked using auto- 
mated whisker tracking*’* (https://openwiki.janelia.org/wiki/display/MyersLab/ 
Whisker+ Tracking). Whiskers are cantilevered beams, with one end embedded 
into the follicle in the whisker pad. Measurement of whisker shape and the 
mechanical properties of the whisker can be used to estimate the forces acting 
on the follicle'’. The amplitudes of the forces in the follicle are proportional to the 
curvature change of the whisker, 4x. We used Ax at a particular location along 
the whisker (2-3 mm) as a substitute for the mechanical forces acting on the 
whiskers'*"°. Curvature was measured from a parametric curve comprising 
second-order polynomials fitted to the whisker backbone. Periods of contact 
between the whisker and object (touch) were detected on the basis of nearest 
distance between whisker and object. 

Behavioural features. Licking was detected using a lickometer’. Lick rate (Hz) was 
the inverse of the inter-lick interval. Motor cortex neurons primarily code for slow 
variables related to whisking®***. We decomposed whisking (that is, 0 at base) into 
whisking set-point and whisking amplitude. Whisking set-point was defined as the 
0.6 s moving average of 0. Whisker amplitude was defined as the Hilbert trans- 
form** of the absolute value of the band-pass filtered (6-30 Hz) 0. Whisking set- 
point and whisker amplitude were highly correlated in our behavioural task 
(Supplementary Fig. 5a) and only one of these features (whisking amplitude) is 
shown in most figures (Figs 2 and 3). To characterize the quality of the decom- 
position we reconstructed whisker position from set-point, whisker amplitude and 
whisking phase. The reconstruction error (root mean squared error) across all 
animals was 4.5 + 0.18° (mean + s.e.m.; range 3.5-6.1°), corresponding to 30% of 
the variance of whisker position. The analysis of curvature was restricted to periods 
of touch. We further derived features related to protraction touch (negative 4x) 
and retraction touch (positive 41) and separately absolute values. 

All behavioural features were down-sampled to the image-acquisition rate 
(16 Hz). Mean and maximum values were calculated for each feature in a 64-ms 
window centred on the middle of the new sampling point. Mice whisk at 
frequencies above 8 Hz (the Nyquist frequency)” and this information is lost by 
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down-sampling. Across animals, 48 + 1% of the energy of the whisking trajectory 
was contained at frequencies below 8 Hz. The higher frequency signals (that is, 
phase) are not thought to be coded by most of the motor cortex neurons®*. Also, 
our imaging methods would be too slow to capture modulation with whisking phase. 
Correlation analysis to determine if multiple varicosities are part of the same 
axon. vM1 axons form elaborate arborizations in L1 of the vS1, and multiple 
varicosities in one field of view were often part of the same axon. To characterize 
the diversity of signals impinging on vS1, we wanted to report activity in distinct 
axons, as opposed to distinct varicosities. We used correlation-based methods to 
distinguish varicosities that were part of the same axon and those that were part of 
different axons (Fig. 1h-)). 

We first selected varicosities that, on the basis of structural images, were part of 
the same axon (14 sessions). We computed their correlation coefficient over an 
entire session, and compared them to correlation coefficients computed over all 
pairs of varicosities (dominated by pairs from different axons). As expected”’**, 
pairs of varicosities from the same axon showed markedly higher correlation 
coefficients compared with random pairs. The correlation coefficient increases 
with increasing event rates. For varicosities from the same axon we plotted the 
correlation coefficients as a function of event rates and fitted an exponential curve 
to the data (Fig. 1j). The 95% confidence value of the fit was used as a criterion to 
assign pairs of varicosities to the same axon, even in the cases where the axonal 
morphology could not be discerned. 

To build clusters of correlated ROIs we selected all the pairs that were consid- 
ered to be from the same axon on the basis of correlations (Fig. lh-j). We seeded a 
cluster with one randomly selected pair. The next randomly selected pair could 
share one of the ROIs with the existing cluster, in which case it joined the cluster; 
otherwise it seeded a second cluster, and so on. We iterated this procedure until all 
pairs were assigned. Each cluster was represented in the data set by a ‘represent- 
ative’ ROI, defined by the largest mean AF/Fo. 

Characterizing movement using mice with GFP-expressing axons. Axons and 
their varicosities are tiny structures (one micrometer diameter, or less”). 
Movement of the animal could displace axons in and out of the objective focal 
plane and thus produce motion artefacts in the fluorescence signal. To quantify 
these errors we performed control experiments in three mice with vVM1l—vS1 
axons labelled with green fluorescent protein (GFP). All experimental conditions 
were identical, except that rAAV-SYN-GEFP (serotype 2/1) was injected into vM1 
instead of the GCaMP3-expressing virus. These mice were trained in the object- 
localization behaviour and analysed identically to mice expressing GCaMP3 
(Supplementary Fig. 3). Although varicosities in GFP animals showed move- 
ment-induced changes in fluorescence, mainly during the licking period, these 
changes rarely exceeded 25%. 

Characterization of axonal varicosities in brain slices. Male C57BL/6 mice 
(P15) were injected in the left vM1 with 20 nl AAV-SYN-GCamP3, serotype 2/1 
as described*. Two weeks later vS1 slices were prepared. Mice were anaesthetized 
with an intraperitoneal injection of a ketamine/xylazine mixture (0.13 mg keta- 
mine and 0.01 mg xylazine per gram of body weight) and perfused through the 
heart with ice-cold artificial cerebrospinal fluid (ACSF; ~5 ml) containing 
127 mM NaCl, 25 mM NaHCOs, 25 mM p-glucose, 2.5 mM KCl, 1mM MgCh, 
2mM CaCl, and 1.25 mM NaHPO,, aerated with 95% O, and 5% CO. The brain 
was removed and placed into ice-cold cutting solution containing 110 mM choline 
chloride, 25 mM NaHCO, 25 mM p-glucose, 11.6 mM sodium ascorbate, 7 mM 
MgCl, 3.1mM sodium pyruvate, 2.5 mM KCl, 1.25mM NaH,PO, and 0.5 mM 
CaCl,. Coronal slices (300 jum thick) of the right barrel cortex were cut with a 
vibrating slicer (Microm, Walldorf) and incubated in oxygenated ACSF for 45 min 
at 37°C and then at room temperature (approximately 23°C). For recording, 
slices were placed in a chamber with circulating ACSF at 34°C under a custom- 
built two-photon microscope. A 40 X 40 ,1m area of L1 in the barrel cortex was 
imaged at 16 Hz using a Ti:sapphire laser tuned to 2 = 925nm. L1 axons were 
stimulated with an extracellular electrode at 83 Hz. Fluorescence traces were 
extracted as described for the in vivo experiments. 

Decoding behavioural variables. The relationship between fluorescence signal x; 
of the ith axon and the jth behavioural variable y; can be characterized as an 
encoding description P(x;| y) or a decoding description P(y;|x,). The encoding 
description specifies how much of the activity of the axon can be accounted for by 
the behavioural variables. The decoding description specifies how well a beha- 
vioural variable can be derived from the activity of populations of axons (Fig. 3a 
and Supplementary Fig. 4) or single axons (Figs 3b-e and 4i, j). Here, we focused 
on the decoding description. 

We used machine learning algorithms to decode the behavioural variables based 
on activity. For single axon decoding (Fig. 3b-e) each axon was used to predict 
behavioural variables, such as forces on each whisker, whisking amplitude and lick 
rate. The input to the algorithm was the event rate (that is, deconvoluted AF/F,) of a 
given axon as well as time-shifted versions. To predict sensory input we used present 
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and future activity. For motor variables we used both past and future values, as neural 
activity could reflect motor commands, corollary discharges or re-afferent input. 

The goal of the decoder algorithm was to find a mapping 
T(t) =f [xi(te—1).--- Xi(tk), --- Xi(te+p)] that best approximates y;(t,) for all t,. 
t, is discretized time (in units of 1/16 s, corresponding to the imaging rate); ] and p 
represent the maximum negative and positive shifts of the activity, respectively. 

We concatenated trials to generate a vector ¢ of time-binned data. For sensory 
variables we used | = 5 and p = 0 and for sensory-motor variables / = 5 and p = 5 
(corresponding to time-shifts up to 0.32 s). The dimensionality of the input vari- 
ables is!+ p+ 1. 

For decoding neural populations (Fig. 3a and Supplementary Fig. 4a) we used 
for each session all axons showing at least one event and created an input vector of 
size Naxons X (J + p + 1). Each behavioural session, corresponding to one imaged 
field of view, was treated separately. 

The algorithm was trained on a subset of trials (the training set, 80%) and 
evaluated on a separate set of test trials (20%) We repeated this procedure five 
times to obtain a prediction for all trials”. 

The accuracy of the decoding algorithm was evaluated using the Pearson cor- 
relation coefficient (p) between the model estimate and the data. The explained 
variance is R’ = p” (range, 0-1). R” was calculated separately for each trial type 
(that is, hit, correct rejection, miss and false alarm). Treating trial types separately 
was critical to disambiguate the relationship between different behavioural vari- 
ables and activity of single axons. For instance, we observed large-amplitude 
whisking during licking, which complicates the classification of the response type 
of the axon. However, during correct-rejection trials, licking was absent and 
whisking present, simplifying the classification of axon types. Similarly, in trained 
animals, touch and licking occurred with short latencies in hit trials. By contrast, in 
false-alarm trials touch was typically absent or diminished in magnitude. 

Decoding was with Random Forests”, a multivariate nonparametric machine 
learning algorithm based on bootstrap aggregation (that is, bagging) of regression 
trees. We used the TreeBagger class implemented in Matlab. TreeBagger requires 
only few parameters: the number of trees (Ntrees = 32), the minimum leaf size 
(minleaf= 10), the number of features chosen randomly at each split 
(Nopiit = Neeatures/3; the typical value used by default*’). These parameters were 
chosen as a trade-off between decoder accuracy and computation time. We did not 
observe much improvement in decoding accuracy for Ntree; > 32 and minleaf 
<10 (data not shown). 

Classification of response types. We measured the R* between each measured 
behavioural variable (that is, whisking amplitude, lick rate, whisking set-point, etc) 
and each axon’s decoder prediction for all the trials and for each trial type (Fig. 3b- 
e). In addition, we used ANOVA to determine whether calcium responses were 
different for different pole locations (Fig. 4). We grouped the behavioural variables 
in larger categories such as whisking (that is, including whisking amplitude and 
whisker set-point), lick rate and touch (that is, touch per whisker, rate of change of 
forces, absolute magnitude, etc). We considered the best R’ set for each of the three 
behavioural categories. Alternatively, all axons were classified manually on the 
basis of trial-to-trial calcium transients and behavioural prediction for each trial 
type. For most axons classification was unambiguous based on the decoder R? 
values. The remaining axons were more accurately classified based on a rarer trial 
type. For example, touch-related axons were much more active in trials with 
touches compared to trials without touches (Supplementary Fig. 4c). Lick-related 
axons, but not touch-related axons, were active during licking in false-alarm trials. 
Three of the authors independently arrived at consistent classifications. 

Clustering of response types. We used clustering algorithms to display the trial- 
averaged responses of all active unique axons (Fig. 3b). First, we divided the axons into 
five categories: touch, whisking, licking, mixed and unclassified. Within each category 
we computed a similarity index between pairs of axons based on the trial-averaged 
responses in hit, correct-rejection and false-alarm trials. For each axon we concate- 
nated their trial-averaged response for these three trial types. The similarity index was 
the pairwise correlation between the concatenated trial-average responses between 
every axonal pair. We computed a weighted correlation to take into account that the 
number of false-alarm trials is typically lower than either hit or correct-rejection trials. 

Given the two vectors y and x and the weight vector w (by the relative number of 
trials in each trial type): 


i WiXi 
iwi 


Weighted covariance: cov(x.y; w) 


Weighted mean: (x; w) = 
do wilxi — UR: W))(Vi — UO; W)) 
iwi 
cov(x,y; w) 
v/ cov(X,x; w)cov(y.y; W) 

Axons were reordered using hierarchical clustering for each category separately 
(using linkage and dendrogram from Matlab with unweighted average distances 
between clusters). 


Weighted correlation: corr(x.y; W) 
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Analysis of sustained activity. For all cells showing object-location-dependent 
activity we tested for parametric persistent activity. Fluorescence traces were 
deconvolved with an exponential decay characterized by the GCaMP3 decay-time 
constant (Tj/. = 0.45 s) (compare Fig. 4d, f and Supplementary Fig. 8c, d). The 
mean value of the deconvolved traces at different times after contact (0.2-1.1, 1.1- 
2.0, 2.0-2.9 and 2.9-3.8s after the last contact) was tested for object-location- 
dependent activity (two-way mixed-effect ANOVA). To determine whether 
object-location-dependent motor behaviours could explain object-location- 
dependent activity, similar analyses were performed for lick rate and whisking 
parameters (Supplementary Fig. 8f, g). 

We applied linear and nonlinear decoders (Random Forests, linear and 
quadratic discriminant analysis; naive Bayes was also tested and gave identical 
results to linear discriminant analysis) to determine the time course of the 
object-location memory traces and compared their performance 
(Supplementary Fig. 8h). Each decoder was trained to predict the pole position 
as a function of time on the basis of the average activity (that is, events) in the 
previous 0.5 s for each axon. We performed 200 repetitions of this procedure to 
compute the average performance of each decoder and their confidence inter- 
vals. We ensured that there was an equal number of trials per each of the four 
possible pole positions (that is, chance level corresponded to 0.25). As before, we 
used 80% of the trials to train the decoders and tested the performance in the 
remaining 20% trials (repeating this five times to obtain a prediction for the full 
set of trials). The linear and quadratic discriminant decoders assume that the 


input is a multivariate Gaussian and find the optimal linear or quadratic curve 
that best separates the classes*’. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature11372 


Corrigendum: Recurrent network 
activity drives striatal 
synaptogenesis 


Yevgenia Kozorovitskiy, Arpiar Saunders, Caroline A. Johnson, 
Bradford B. Lowell & Bernardo L. Sabatini 


Nature 485, 646-650 (2012); doi:10.1038/nature11052 


On page 3 of the PDF and print version of this Letter, column two, in 
the text describing the mEPSC frequency and dendritic spine density 
values shown in Fig. 3b and 3c, the words ‘saline’ and ‘cno’ were 
erroneously exchanged. The correct text should read: (D1-Cre, direct 
pathway MSNs: frequency, saline 0.8+0.1Hz, n=18; cno 
0.36 + 0.07 Hz, n = 25; spine density, saline 0.96 + 0.04 spines per 
lum, n= 5; cno 0.49 + 0.05 spines per um, n = 5; D2-Cre, indirect 
pathway MSNs: frequency, saline 0.04+0.01Hz, n=7; cno, 
0.27 + 0.09 Hz, n = 8; spine density, saline 0.54 + 0.05 spines per 
tum, n= 5; cno 0.98 + 0.06 spines per um, n=5). This has been 
corrected online in the PDF and HTML of the original paper. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature11373 


Corrigendum: Biodiversity loss and 


its impact on humanity 


Bradley J. Cardinale, J. Emmett Duffy, Andrew Gonzalez, 
David U. Hooper, Charles Perrings, Patrick Venail, 

Anita Narwani, Georgina M. Mace, David Tilman, 

David A.Wardle, Ann P. Kinzig, Gretchen C. Daily, 
Michel Loreau, James B. Grace, Anne Larigauderie, 
Diane S. Srivastava & Shahid Naeem 


Nature 486, 59-67 (2012); doi:10.1038/nature11148 


In Table 1 and Supplementary Table 2 of this Review, under the 
‘Category of service’ called ‘Regulating’, the first two ‘Measures of 
service provision’ related to ‘Biocontrol’ should read “Abundance of 
herbivorous pests’ instead of ‘Control of herbivorous pests’. With this 
word change, a downward arrow for either the predicted or actual 
diversity—service relationship would indicate that the abundance of 
herbivorous pests declines (and biocontrol increases) with increasing 
plant diversity. This does not alter any of our conclusions, because all 
diversity-service relationships were correctly described in the text of 
the manuscript itself. These errors have been corrected online in the 
HTML and PDF versions of the original Review, and in the original 
Supplementary Information. 
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Observations of increased tropical rainfall preceded 


by air passage over forests 


D. V. Spracklen', S. R. Arnold’ & C. M. Taylor? 


Vegetation affects precipitation patterns by mediating moisture, 
energy and trace-gas fluxes between the surface and atmosphere’. 
When forests are replaced by pasture or crops, evapotranspiration 
of moisture from soil and vegetation is often diminished, leading 
to reduced atmospheric humidity and potentially suppressing 
precipitation”*. Climate models predict that large-scale tropical 
deforestation causes reduced regional precipitation* '°, although 
the magnitude of the effect is model*" and resolution® dependent. 
In contrast, observational studies have linked deforestation to 
increased precipitation locally’ * but have been unable to explore 
the impact of large-scale deforestation. Here we use satellite 
remote-sensing data of tropical precipitation and vegetation, 
combined with simulated atmospheric transport patterns, to assess 
the pan-tropical effect of forests on tropical rainfall. We find that 
for more than 60 per cent of the tropical land surface (latitudes 30 
degrees south to 30 degrees north), air that has passed over extens- 
ive vegetation in the preceding few days produces at least twice 
as much rain as air that has passed over little vegetation. We 
demonstrate that this empirical correlation is consistent with 
evapotranspiration maintaining atmospheric moisture in air that 


passes over extensive vegetation. We combine these empirical rela- 
tionships with current trends of Amazonian deforestation to 
estimate reductions of 12 and 21 per cent in wet-season and dry- 
season precipitation respectively across the Amazon basin by 2050, 
due to less-efficient moisture recycling. Our observation-based 
results complement similar estimates from climate models*"°, in 
which the physical mechanisms and feedbacks at work could be 
explored in more detail. 

To explore the links between vegetation and rainfall, we analysed 
combined satellite data on precipitation from the Tropical Rainfall 
Measuring Mission (TRMM) and other satellites’? (TRMM3B42) 
and data on leaf area index (LAI) from the Moderate Resolution 
Imaging Spectroradiometer'® (MODIS). In these data positive spatial 
correlations exist between annual mean precipitation and annual 
mean LAI (Fig. 1a, d; Pearson correlation coefficient, r = 0.81), high- 
lighting the role of precipitation in controlling large-scale vegetation 
patterns. In this study, our aim was to investigate a causal effect of 
vegetation on tropical (30° S to 30° N) rainfall in subsequent days on a 
regional scale (over distances of hundreds to thousands of kilometres). 
To do this, we calculated the origin and atmospheric transport of air 
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Figure 1 | Annual (2001-2007) mean vegetation, precipitation and evaporation. a, Leaf area index (LAI) from MODIS. b, Example 10-d back-trajectories 
arriving daily during 2001. Boxes illustrate the four domains analysed in detail in this study. c, Mean cumulative exposure of back-trajectories to LAI over the 
preceding 10d. d, Precipitation reported by TRMM and other satellites (TRMM3B42). e, Evapotranspiration computed as the mean of the four GLDAS models. 
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masses to determine the prior exposure of air to vegetation. This 
allowed us to evaluate whether air that had passed over more vegeta- 
tion produced more rainfall. 

To calculate air-mass histories, we used a Lagrangian atmospheric 
transport model. Such back-trajectory methods have been used previ- 
ously to identify the transport of atmospheric moisture to continental 
regions'””*. We calculated atmospheric back-trajectories arriving daily 
at the centre of all continental 1° x 1° grid squares over the tropical 
domain for 2001-2007. The trajectories were calculated using opera- 
tional analysis data from the European Centre for Medium-Range 
Weather Forecasts (ECMWE), and are hence consistent with the 
large-scale atmospheric flow from the assimilated observations. We 
tested the sensitivity of our analysis to the length, arrival height and 
arrival time of the back-trajectories and found consistent results across 
a broad range of choices (Supplementary Figs 1, 2 and 3). Here we 
present results for ten-day back-trajectories arriving at the surface at 
12:00 ur. Figure 1b shows example back-trajectories arriving at four 
tropical locations. 

For each individual trajectory, we calculated the cumulative LAI 
(LAI) encountered by the air mass during the last ten days of atmo- 
spheric transport. The resulting climatology of “LAI (Fig. 1c) is very 
similar to the in situ LAI, although important differences are apparent. 
For example, there are regions surrounding the Amazon and Congo 
basins where in situ LAI is relatively low but ~LAI is high owing to 
exposure of air to large amounts of vegetation as it travels across 
forested regions upwind. 

We analysed relationships between the daily variability in NLAI and 
the daily variability in precipitation. Figure 2a shows these relation- 
ships for air masses arriving in a ~1,000km X 1,000km region of 
Minas Gerais, Brazil, near the Amazon basin. To reduce the influence 
of the initial state of the air mass on our analysis, we stratified the data 
into that from the wet season and that from the dry season and then 
further according to the initial specific humidity of the trajectory, on 
the basis of the ECMWE analyses. We found a strong positive and 
significant (Student’s t-test, P< 0.01) relationship between the expo- 
sure of air masses to vegetation and the precipitation those air masses 
produce (Fig. 2b). For air masses with low to medium exposure to 
precedent vegetation (“LAI<10m’m 7d), mean dry- and wet- 
season rainfall increases by 0.25 mm d-tand, respectively, 0.4mm d} 
for every additional unit of exposure to LAI that is encountered in the 
preceding 10d. The impact of vegetation exposure tends to saturate, 
especially during the wet season, with less sensitivity of rainfall where 
SLAI>10m*m 7d. Although we find that the initial specific humid- 
ity of the air mass does affect precipitation, with moister air masses 
typically producing more precipitation, the positive trend between 
precipitation and exposure to LAI is similar for all subsets of the data. 
Qualitatively similar results are found ina 1,000 km X 1,000 km region 
south of the tropical forests of the Congo basin (Fig. 2b). Over the 
moist tropical forests themselves, differences in rainfall between air 
masses with low and high exposures to LAI are notably smaller 
(Fig. 2b), although for the Congo basin the differences are significant 
in the dry season (P< 0.01). We note that the positive relationships 
found between precipitation and SLAI are not due to the length of 
time the trajectories have spent over land, with distance travelled and 
LAI often poorly correlated (Supplementary Table 1), or to the aver- 
age topographic height the trajectory has crossed. When we repeated 
our analysis using the distance travelled by the trajectory over land or 
the average elevation of the topography crossed by the trajectory (in 
place of XLAI), the relationships were substantially weaker and in 
some cases negative (Supplementary Figs 4 and 5). 

We extended this regional analysis to explore relationships between 
precipitation and vegetation across the tropics (Fig. 2c). For more than 
60% of the tropical land surface, precipitation is a factor of at least two 
greater in air masses that have been exposed to extensive vegetation in 
the preceding days (defined as the top decile of LAI), relative to air 
masses that have been exposed to little (defined as the bottom decile of 
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Figure 2 | Relationships between daily precipitation and cumulative 
exposure of 10-d back-trajectories to vegetation LAI (LAI) for 2001-2007. 
a, Plot for air masses arriving in Minas Gerais, Brazil (10-20° S, 40-50° W). 
Data binned into deciles of SLAI and stratified by initial specific humidity (q). 
Lines show fit to data (solid, wet season; dotted, dry season) and error bars 
indicate estimation of error in precipitation (Methods Summary). 

b, Comparison of daily precipitation for air masses that have been exposed to 
small and large amounts of vegetation (significant (P< 0.01) differences 
indicated by squares at top of panel) during atmospheric transport to the 
Amazon basin (AB; 10-0° S, 60-70° W), Minas Gerais (MG), the Congo basin 
(CB; 5° N-5° S, 15-25° E) and south of Congo (ZA; 10-20° S, 20-30° E) (mean, 
star; median, line; 25th and 75th percentiles, box; 5th and 95th percentiles, 
whiskers). c, Number of calendar months with significant (P < 0.01; red, 
positive; blue, negative) relationships between precipitation and NLAI. 
Stippling denotes regions where precipitation is a factor of at least two greater in 
air with large exposure to vegetation than in air with small exposure. Green 
contour delimits areas with >3 m? m’” annual mean LAI. Black boxes mark 
the four regions in b. 


LAI). Significant (P < 0.01) positive correlations between precipita- 
tion and LAI are common features for much of the year in areas 
surrounding the Amazon (southern Brazil and Paraguay) and Congo 
(southern and eastern Africa) forests, matching where previous studies 
have found large continental precipitation recycling ratios'*. We find 
few tropical regions with significant negative correlations, although the 
relationships between vegetation and precipitation are typically 
weaker in moist tropical forests. The weaker signal at the centre of 
extensive forests is probably due to the lack of variability in air-mass 
exposure to vegetation (Supplementary Fig. 6); however, saturation of 
the MODIS LAI retrievals for dense tropical forest canopies’”? may 
have a role. 

This analysis demonstrates that there are strong positive relation- 
ships between the cumulative exposure of air to vegetation and the 
amount of precipitation that air will produce, suggestive of a water- 
cycle feedback. To explore potential mechanisms underlying these 
relationships, we evaluated the change in atmospheric moisture that 
occurs along our back-trajectories. To do this we calculated the net 
change in specific humidity (Aq) that occurred during continental 
transport in the ECMWF humidity analyses along the 10-d back- 
trajectories (Fig. 3a). In general, air becomes drier during atmospheric 
transport over land, owing to lower continental evaporation rates as 
compared with the oceans. Figure 3a demonstrates that air masses that 
have been exposed to more vegetation remain significantly moister 
(P< 0.01), and in some cases air can actually moisten when crossing 
densely vegetated regions. Analysis of the latter cases indicates that 
typically 70-90% of increases in q occur during the hours of daylight, 
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Figure 3 | Atmospheric water-budget components along back-trajectories. 
a, Same as Fig. 2b, but for net change in atmospheric specific humidity (Aq) asa 
function of SLAI. b, Same as Fig. 2b, but for cumulative surface evaporation 
(XET) as a function of =LAI. c, Same as Fig. 2a, but for precipitation as a 
function of VET. 


consistent with a dominant forcing from evapotranspiration 
(Supplementary Fig. 7). 

To quantify the contribution of evapotranspiration to the air-mass 
water budget, we used output from four global land surface models 
that use the best available estimates of meteorological forcing (those 
from the Global Land Data Assimilation System’? (GLDAS)) (Fig. le). 
Figure 3b shows significant (P < 0.01) positive relationships between 
the multi-model mean cumulative surface evaporation (SET) and 
“LAI in all four regions examined. Figure 3c shows rainfall as a func- 
tion of LET for the Minas Gerais region, demonstrating that additional 
moisture from evapotranspiration emitted into air masses with large 
exposure to vegetation is substantially greater than the additional pre- 
cipitation observed in these air masses. Indeed, for all four regions the 
extra LET emitted into air masses with large vegetation exposure 
exceeds the observed additional precipitation by a factor of at least 
four (Supplementary Table 2). 

Our analysis explores the role of regional-scale vegetation patterns 
on precipitation. Through evapotranspiration, forests maintain atmo- 
spheric moisture that can return to land as rainfall downwind. These 
processes operate on timescales of days over distances of 100-1,000 km 
(ref. 18) such that large-scale land-use change may alter precipitation 
hundreds to thousands of kilometres from the region of vegetation 
change. Land-use patterns and small-scale deforestation may also alter 
precipitation locally, through changes in the thermodynamic profile 
and the development of surface-induced mesoscale circulations”. 
Natural and pyrogenic emissions from vegetation can also have a role 
in rainfall initiation over tropical forest regions”. The impact of cloud 
microphysical processes on precipitation is highly uncertain, and 
biogenic emissions could contribute to our observed relationship 
between rainfall and exposed vegetation. However, our water-balance 
calculations imply that cumulative increases in evapotranspiration 
over upstream forested regions more than account for the increase 
in downstream rainfall. 

Rapid land-use change is occurring across large regions of the 
tropics: 40% of the Amazon is predicted to be deforested by 2050 under 
a business-as-usual scenario”. We used this scenario to explore the 
potential sensitivity of rainfall to changes in moisture recycling as a 
result of deforestation. We combined the deforestation scenario with 
present-day LAI to produce a new spatial distribution of LAI 
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Figure 4 | Simulated percentage change in precipitation due to 2000-2050 
business-as-usual deforestation of the Amazon basin. a, Wet season; b, dry 
season. Stippling denotes regions where the simulated precipitation anomaly 
differs from the present-day (1998-2010) rainfall by more than 1 s.d. The 
Amazon (black) and Rio de la Plata (red) basins are marked. 


(Supplementary Fig. 8) and then used our trajectories to calculate 
LAI under the deforested scenario. We then applied our empirical 
relationships between LAI and rainfall (Fig. 2b) to estimate 
the change in rainfall that might occur as a result of this extensive 
deforestation. In this calculation, we implicitly assumed there to be 
no change in the large-scale circulation and that the local effects of 
deforestation on rainfall were negligible downstream. Despite these 
assumptions, our estimates are broadly consistent with estimates of 
basin-wide deforestation from climate models*"'°. Using this method, 
we estimated a 12% reduction in wet-season precipitation and a 21% 
reduction in dry-season precipitation across the Amazon basin 
(Fig. 4). This sensitivity is not restricted to the region of deforestation, 
and we estimate a 4% decrease in annual total precipitation for the Rio 
de la Plata basin. Through comparison with TRMM3B42 data, we 
calculate that the estimated reduction in precipitation is equivalent 
to the basin-wide drought experienced across the Amazon in 2010. 
Such a reduction in precipitation may have consequences for the 
future of remaining Amazonian forests*” and for rainfall-reliant 
industries both within and outside the Amazon basin, including 
agriculture and hydroelectric power generation, which contribute sub- 
stantially to South American economies. The successful efforts to curb 
Amazon deforestation that have been applied in recent years** must be 
maintained if large-scale clearance of the Amazon and the resulting 
impacts on regional rainfall are to be avoided. 


METHODS SUMMARY 

Remote-sensed data. We used precipitation retrievals from the 3B42 3-h 0.25° 
X 0.25° product of TRMM and other satellites!’ to calculate daily accumulated 
(24-h) rainfall. We used monthly mean LAI from MODIS'® using the 
MOD15_BU_V5 product available at 0.25° X 0.25° resolution. We spatially 
averaged both products to 1° X 1° resolution. We apply this temporal and spatial 
averaging to the precipitation data to reduce random error”. We estimate the total 
error to be the random error, estimated according to ref. 28, plus a systematic error 
estimated as 0.2 times the absolute precipitation*®, combined in quadrature. 
Land surface models. We used 1° X 1° monthly mean evaporation from four 
global land surface models archived on GLDAS”. The models are forced by a 
combination of meteorological data sets including atmospheric analysis, and 
precipitation from merged gauge-satellite products. 

Atmospheric transport. We calculated kinematic atmospheric back-trajectories 
arriving four times daily (00:00, 06:00, 12:00 and 18:00 ur) ona 1° X 1° grid for the 
period 2001-2007 using the OFFLINE trajectory model. The position of each 
trajectory is calculated every 30min and output every 6h. We calculated 3-, 
5- and 10-d trajectories arriving at the surface and three altitudes above the surface 
(corresponding to air pressures of 900, 800 and 700 hPa) that are likely to be within 
the deep tropical boundary layer. 

Analysis. For each trajectory, we calculated the total distance travelled over land, 
the cumulative exposure to LAI (ULAI), the average elevation of the topography 
over which the trajectory travelled and the cumulative evapotranspiration (ET; 
specified by GLDAS). We fitted rainfall data with a function of the form 
fix) =ae* +c (Supplementary Table 2). We calculated “LAI regardless of 
trajectory pressure. Restricting the calculation to when trajectory pressure is 
greater than 850 hPa gives similar results (Supplementary Fig. 9). 
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METHODS 


Remote-sensed data. We used precipitation retrievals (from the Tropical Rainfall 
Measuring Mission (TRMM; http://trmm.gsfc.nasa.gov/) and other satellites) in 
the 3B42 3-h0.25° X 0.25° product’* to calculate daily accumulated (24h) rainfall. 
This product reports precipitation from microwave sensors on board polar- 
orbiting satellites, combined with more frequent cloud-top temperature data, 
surface rain gauge analysis and the precipitation radar on TRMM. We used 
the monthly mean leaf area index (LAI) from the Moderate Resolution 
Imaging Spectroradiometer (MODIS; http://modis.gsfc.nasa.gov/data/dataprod/ 
dataproducts.php?MOD_NUMBER=15)'® using the MOD15_BU_V5 product 
available at 0.25° X 0.25° resolution. 

The precipitation product includes both systematic and random error””®. 
Systematic biases of 0.5-1 mm d_', or 0.2 times the mean precipitation, have been 
estimated”. To reduce the random error we spatially averaged the precipitation 
product to 1° X 1° resolution and temporally averaged to give daily accumulated 
(24h) precipitation. This temporal and spatial averaging applied to TRMM3B42 
means that each data point in our analysis (which corresponds to a single trajectory) 
is the average of 128 TRMM3B42 data points. This substantially reduces the 
random error that is present in the TRMM3B4?2 product”. We estimate the random 
error using the method of ref. 29 and assume a systematic error of 0.2 times the 
mean precipitation*’. We combine the random and systematic errors in quadrature 
to give an estimate of the total error, which in any case is substantially smaller than 
the large effect of vegetation on precipitation. We spatially averaged the LAI data to 
the same resolution as the precipitation product. 

Land surface models. We used 1° X 1° monthly mean evaporation from four 
global land surface models (Community Land Model (CLM), Variable Infiltration 
Capacity model (VIC), NOAH and MOSIAC) archived on the Global Land Data 
Assimilation System (GLDAS; http://disc.sci.gsfc.nasa.gov/services/grads-gds/ 
gldas)”’. The models are forced by a combination of meteorological data sets 
including atmospheric analysis, and precipitation from merged gauge-satellite 
products. 

Atmospheric transport. We calculated kinematic atmospheric back-trajectories 
arriving daily (00:00, 06:00, 12:00 and 18:00 UT) on a 1° X 1° grid for the period 
2001-07 using the OFFLINE trajectory model*’. The position of each trajectory is 
calculated every 30 min, and output every 6h. We calculated 3-, 5- and 10-day 
trajectories arriving at the surface and 3 levels above the surface (900, 800 and 
700 hPa) that are likely to be within the deep tropical boundary layer. We demon- 
strate that our results are robust to the back-trajectory length (Supplementary 
Fig. 1), arrival pressure (Supplementary Fig. 2) and arrival time (Supplementary 
Fig. 3) of the trajectory. Throughout the Letter we report analysis of 10-day back- 
trajectories arriving at the surface at 12:00 UT. 

Analysis. For each trajectory we calculated (1) total distance travelled over land 
(Xdist), (2) cumulative exposure to LAI (LAI), (3) average elevation of the 
topography (specified by the Climate Research Unit CRU CL 2.0 database, 10’ 
resolution) over which the air mass travels, and (4) cumulative evapotranspiration 
(specified by GLDAS; ET). We calculate LAI regardless of trajectory pressure 
but show that restricting the calculation to when trajectory pressure is greater than 
850 hPa gives similar results (Supplementary Fig. 9). 


We analysed relationships between daily precipitation and the variables calcu- 
lated from air mass history (Xdist, LAI and VET). We stratified trajectories into 
dry season and wet season and according to the initial specific humidity (q) of the 
back-trajectory (taken from the ECMWE analyses). The timings of wet and dry 
season are location dependent. For the purpose of this analysis we defined the dry 
season at any location as calendar months with below annual average precipitation 
for that location and the wet season as calendar months with above average 
precipitation (as observed by TRMM3B42; Supplementary Table 2). We demon- 
strated that this stratification results in large variability in initial q of the back- 
trajectory (Supplementary Table 2). We conducted detailed analysis over 4 large 
(10° X 10°, ~1,000 km X 1,000 km) domains. Our pan-tropical analysis was con- 
ducted at a horizontal resolution of 2.5°  2.5°. For our 7-year analysis each 10° 
X 10° grid box represents 255,000 trajectories whereas each 2.5° X 2.5° grid box 
represents 15,968 trajectories. 

We binned trajectories into deciles of LAI. We then compared back- 
trajectories with small exposure to vegetation (lowest decile of ULAI) to back- 
trajectories with large exposure to vegetation (largest decile of LAI) and used the 
Student’s t-test to determine the significance of any differences. Accounting for the 
stratification described above means that each data point (for example, data point 
in Fig. 2a, box-plot in Fig. 2b) represents 3,650 trajectories. We fitted relationships 
between precipitation and LAI with functions of the form y(x) = aexp(bx) +c 
(see Fig. 2a). Supplementary Table 2 gives the fitted variables for the 10° x 10° 
domains. Using a linear fit (y(x) = ax + c), which does not capture the nonlinear 
behaviour of the data well, altered our estimated impacts of deforestation on 
annual mean Amazon basin rainfall from —14% to —10%. 

To estimate the impact of Amazonian deforestation on precipitation we 
combined the functions we fitted above with projected LAI distributions after 
deforestation. We created an LAI distribution for the year 2050 (see 
Supplementary Fig. 8) by combining a business-as-usual deforestation scenario” 
with the present-day LAI distribution from MODIS. We assumed that deforested 
areas are maintained as pasture with a LAI of 1 m’m ” (ref. 32). We then ran our 
present-day trajectories over the projected LAI to calculate the “LAI that would 
occur in the deforestation scenario. We used our empirical relationships (calcu- 
lated at a resolution of 2.5° X 2.5°) along with “LAI to estimate the rainfall that 
would occur after deforestation. We estimated both wet season and dry season 
rainfall. Our approach only estimates the change in rainfall due to changes in water 
recycling. It makes the implicit assumption that there is no change in the large- 
scale circulation, and that the local impacts of deforestation on rainfall are neg- 
ligible downstream. Calculation of the full impacts of deforestation on rainfall 
would require a climate model. 

We compared the estimated changes in rainfall due to deforestation with pre- 
sent-day (1998-2010) rainfall recorded by TRMM3B42. We make comparisons 
for both the wet season and the dry season. 
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Distinct contribution of stem and 
progenitor cells to epidermal maintenance 
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Benjamin D. Simons** & Cédric Blanpain’? 


The skin interfollicular epidermis (IFE) is the first barrier against the external environment and its maintenance is critical 
for survival. Two seemingly opposite theories have been proposed to explain IFE homeostasis. One posits that IFE is 
maintained by long-lived slow-cycling stem cells that give rise to transit-amplifying cell progeny, whereas the other 
suggests that homeostasis is achieved by a single committed progenitor population that balances stochastic fate. Here we 
probe the cellular heterogeneity within the IFE using two different inducible Cre recombinase-oestrogen receptor 
constructs targeting IFE progenitors in mice. Quantitative analysis of clonal fate data and proliferation dynamics 
demonstrate the existence of two distinct proliferative cell compartments arranged in a hierarchy involving 
slow-cycling stem cells and committed progenitor cells. After wounding, only stem cells contribute substantially to 
the repair and long-term regeneration of the tissue, whereas committed progenitor cells make a limited contribution. 


Skin epidermis is comprised of a basal layer of proliferative cells and 
several suprabasal layers of terminally differentiated cells that are 
progressively enucleated, forming squames that are shed from the 
skin surface’. Depending on anatomical location and species, the 
minimum transit time from the basal layer to the cornified layer 
has been estimated at around 1-2 weeks’. In homeostasis, cells lost 
during the course of turnover must be perfectly compensated by cells 
generated in the basal layer. Different theories have been proposed to 
explain how this balance is achieved’. On the basis of morphological 
and proliferation studies, it has been proposed that the IFE is 
organized into discrete ‘epidermal proliferative units’ (EPUs), com- 
prised of slow-cycling stem cells (SCs) together with around 10 
transit-amplifying cell progeny, which undergo terminal differenti- 
ation after a fixed number of cell divisions**. Following the clonal 
marking of IFE cells by retroviruses°” or mutagens’®"’, long-lived 
columns of labelled IFE cells that span the epidermis from the basal 
layer to the top of the cornified layer appear, lending support to the 
concept of EPUs. By contrast, the quantitative analysis of lineage 
tracing data in IFE using a ubiquitous promoter suggests that tissue 
is maintained by a single, equipotent, committed progenitor (CP) cell 
population in which the balance between proliferation and differenti- 
ation follows from seemingly random cell fate decisions'*’*. Although 
this model is attractive in its simplicity, it does not easily explain the 
ability of tissue to respond rapidly to increased cell demand, such as 
during wound healing, and cannot rule out the presence of a quiescent 
or slow-cycling IFE population’. More importantly, whereas the 
contribution of different populations of hair follicle SCs to wound 
healing is well-established’*’®, little is known about the contribution 
of putative IFE SCs or CP cells to the repair of skin epidermis. 

From a quantitative analysis of lineage tracing data using two 
different Cre recombinase-oestrogen receptor (Cre-ER) transgenic 
mice, we demonstrate the existence of two distinct populations of 
epidermal progenitors that form a SC/CP cell hierarchy that 
differentially contribute to the homoeostasis and repair of the 
epidermis. 


Heterogeneity of epidermal progenitors 


To probe the proliferative heterogeneity within the IFE, we made 
use of two Cre-ER transgenic mice that target IFE progenitors 
(Supplementary Fig. la, b). With a high dose of tamoxifen, Cre-ER 
under the control of the keratin 14 promoter (K14-Cre-ER) can target 
most cells in the basal layer of tail IFE’. A very low dose of tamoxifen 
(0.2 mg) to K14-Cre-ER/RosaYFP induced yellow fluorescent protein 
(YFP) expression at clonal density (Supplementary Fig. 1c). Although 
Cre-ER under the control of the involucrin promoter (Inv-Cre-ER) 
predominantly targets cells in the suprabasal layers, we have recently 
shown that some basal cells, characterized by K5 but not Inv protein 
expression, are also induced but at a lower frequency than K14-Cre- 
ER (Supplementary Fig. 1d-f)*°. The basal cells marked by these two 
different Cre-ER comprise IFE progenitors, as demonstrated by their 
ability to proliferate, and give rise to differentiated progeny, forming 
columns of labelled cells (Supplementary Fig. Ic, d). 

To determine whether the two induction protocols mark the same 
homogenous pool of progenitors, we first quantified their survival at 
different time points after tamoxifen administration. Although many 
of the clones induced by the K14-Cre-ER survived up to 1 year post- 
induction, most of the Inv-Cre-ER targeted clones were progressively 
lost (Fig. la-c). Whereas the population of ‘surviving clones’, defined 
as clones that retain at least one basal layer cell, drops by around 30% 
for K14 mice from 4 weeks to 6 months after labelling, the clone 
density falls by a factor of 10 in the Inv mice over the same period, 
indicating that IFE is maintained by a heterogeneous pool of progenitors 
with different survival potential. 


Inv-Cre-ER targets committed progenitors 


To gain further insight into the behaviour of these apparently distinct 
pools of progenitors, we quantified the pattern of growth of individual 
clones targeted by the Inv-Cre-ER at eight time points ranging from 
3.5 days to 48 weeks post-induction, a period spanning more than half 
of the average lifetime of the mouse. At 3.5 days, 40% of surviving 
clones had undergone at least one round of cell division (Fig. 2a, b and 
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Figure 1 | K14-Cre-ER and Inv-Cre-ER target IFE progenitors with 
different survival characteristics. a, b, Immunostaining of K5 and YFP in K14- 
Cre-ER/RosaYFP (a) and Inv-Cre-ER/RosaYFP (b) tail epidermis at 1 week and 
48 weeks post-induction showing the high survival rate of K14-Cre-ER clones. 
Dashed lines represent the basal lamina. Hoechst nuclear staining is represented 
in blue. c, Quantification of surviving clones in K14-Cre-ER/RosaYFP and Inv- 
Cre-ER/RosaYFP tail epidermis at different times following tamoxifen 
administration (1 = 327 clones at least, for each time point of K14 and Inv). 
Symbols show experimental data and the lines represent the prediction of the 
mathematical modelling. The data has been normalized to the initial clone 
density as inferred by the fit to theory. Error bars show s.d. Scale bars, 200 um. 


Supplementary Fig. 2a), and the vast majority of these divisions 
resulted in one basal and one suprabasal cell (Fig. 2c, d). At 4 weeks 
post-induction, although almost all surviving clones had undergone 
at least one cell division, 40% of clones still contained only one basal 
cell (Fig. 2a, e), indicating that most of these divisions led to asym- 
metric fate outcome. However, the other 60% of surviving clones had 
also expanded basally (Fig. 2a, f), indicating that progenitors can also 
symmetrically self-renew. 

This clonal heterogeneity increased progressively over the 48-week 
time course, indicating that progenitors follow divergent fates 
(Supplementary Fig. 2b, c). At the same time, the average size of 
surviving clones (as measured both by their total size and their ‘foot- 
print’ on the basal layer) grew approximately linearly with time 
(Supplementary Fig. 2d), whereas the number of surviving clones 
progressively fell (Fig. 1c). Such behaviour mirrors that reported previ- 
ously’* in studies using a ubiquitous promoter, and indicates that 
Inv-Cre-ER targets the actively cycling CP cell population, in which 
stochastic cell loss through differentiation is perfectly compensated by 
duplication (Fig. 2g). This conclusion is reinforced by a quantitative 
analysis of the data, which confirms that the basal layer clone size 
distribution converges onto the hallmark scaling behaviour in which 
the chance of finding a clone larger than some multiple of the average 


258 | NATURE | VOL 489 | 13 SEPTEMBER 2012 


a b c 
50 150 
~ 101 = 
230 95 +1@90 : 180 
8 73 Os 318 , ile 
10 + (3 4 + 91 930 4 iis 
N 109°. 3s -$ 2 7 2 Fs 
a & : 
2 gg-58 Bo ee PS 20 : 4 $373 
5 5 = 22s 3 + gaye 
So {89 oe eG 1, Liaes 
ee 2 
(ee ee ees = ‘ 
Os 1 2 4 8 12 24 48 051 2 4 8 12 24 


48 
Time post-labelling (weeks) ) 


@ Committed progenitor 
@ Differentiated cell 


Clone fraction (%) s 


80% 10% 10% 


Division time : 1 per week 2 TRG zs 
1 5 ie) , fi 0 25 0 50100 
Clone size (cells) 


0 1070 4070 1070" 10°20 0 
Clone size (cells) 


Figure 2 | Inv-Cre-ER targets IFE CP cells. a, b, Distribution of Inv-Cre-ER 
clone sizes as measured by basal (a) and total (b) cell content of surviving 
clones, imaged by confocal microscopy on whole-mount tail epidermis from 
3.5 days to 48 weeks following tamoxifen administration. The number of 
analysed clones is indicated in a and this is identical in b, h and i. ¢, Size 
distribution of surviving clones with more than one cell at 3.5 days post- 
labelling disaggregated according to basal and total cell number (n = 37 
clones). d-f, Confocal analysis of representative Inv-Cre-ER-targeted basal 
clones at 3.5 days (d) and 4 weeks (e, f) post tamoxifen. Dashed lines represent 
the basal lamina. Hoechst nuclear staining is represented in blue. g, A fit of the 
Inv-Cre-ER clonal fate data predicts that progenitors undergo population 
asymmetry through a combination of asymmetric cell division, symmetrical 
self-renewal and symmetrical differentiation with an average cell cycle time of 
approximately 1 week. h, i, Frequency distribution of basal (h) and total 

(i) clone size. Symbols represent experimental data and the lines correspond to 
the model prediction. Error bars show s.e.m. Scale bars, 10 Lum. 


stays constant over time (Supplementary Fig. 2e and Supplementary 
Methods, section “Theory’)*’. From the detailed study of the basal and 
total clone size distribution over the 48 week time course (Fig. 2h, i, 
Supplementary Fig. 2f and Supplementary Methods, section “Theory’), 
we are able to conclude that, following CP cell division (at an average 
rate of 1.21 + 0.04 per week), 80% result in asymmetric fate (leading to 
one dividing and one differentiated cell) with the remainder leading to 
symmetric duplication or differentiation with approximately equal 
probability (Fig. 2g), consistent with the results from ref. 12. Indeed, 
from the 48 week time point, we are able to resolve a small (less than 1 
in 14) bias towards terminal division. 


K14-Cre-ER targets long-lived SCs 


To determine whether the dynamics of K14 clones are different from 
the Inv clones, we performed extensive clonal analysis using the K14- 
Cre-ER mice ranging from 3.5days to 48 weeks post-induction 
(Fig. 3). At just 3.5 days, 60% of K14 clones had already undergone 
one round of cell division (Fig. 3a, b and Supplementary Fig. 3a), 
whereas the fraction of clones involving two or more basal cells was 
greatly increased compared with Inv (Fig. 3a—-d). However, following 
an initial abrupt expansion, the growth of K14 clones rapidly 
decelerates over the first few weeks, with the size distribution from 
8 weeks to 6 months post-induction showing only a modest expansion 
(Fig. 3a-f and Supplementary Fig. 3a—c). Alongside the propensity of 
clones to persist, these observations are consistent with the K14 
promoter targeting both the CP cell population and a second, slow- 
cycling, SC population. In particular, the precipitous expansion of 
clones (Fig. 3a—-c), which is far more rapid than that found in the 
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Figure 3 | K14-Cre-ER targets IFE SCs. a, b, Distribution of K14-Cre-ER 
clone sizes as measured by basal (a) and total (b) cell content of surviving clones, 
imaged by confocal microscopy on whole-mount tail epidermis from 3.5 days to 
48 weeks following tamoxifen administration. The number of analysed clones is 
indicated in a and this is identical in b, h and i. c, Size distribution of surviving 
clones with more than one cell at 3.5 days post-labelling disaggregated 
according to basal and total cell number (n = 197 clones). d—-f, Confocal 
analysis of representative K14-Cre-ER targeted basal clones at 3.5 days 

(d), 4 weeks (e) and 48 weeks (f) post tamoxifen. Dashed lines represent the 
basal lamina. Hoechst nuclear staining is represented in blue. Scale bars, 10 [um. 
g, Biophysical modelling of the clonal fate data predicts that K14-Cre-ER targets 
both CP cells and SCs with the latter undergoing population asymmetry 
through a combination of asymmetric cell division, symmetrical self-renewal 
and symmetric differentiation with an average cell cycle rate of between 4 and 6 
per year. h, i, Frequency distribution of basal (h) and total (i) clone size. Symbols 
represent data and the lines correspond to the model prediction. j, FACS 
analysis of K5tT A/tet(O)-H2BGFP mice at different time point following DOX 
administration. k, Frequency of the number of cell divisions that surviving basal 
IFE cells undergo during the chase period as quantified by FACS analysis at 
different time points after DOX administration to adult K5tTA/tet(O)- 
H2BGFP mice. Symbols represent the average of the experimental data (n = 3 
mice at each time point). Lines correspond to the predicted model dynamics 
(Supplementary Methods, section “Theory’). 1, m, Confocal analysis of H2B- 
GFP immunostaining in unchased mice (no DOX) (left panel) and 6 weeks after 
DOX administration (right panel), showing the preferential localization of the 
slow-cycling IFE cells in the bottom of epidermal undulations (stars) (I) and of 
YFP immunostaining in K14-Cre-ER/RosaYFP 8 weeks after tamoxifen 
administration (m). Dashed lines are hair follicles. Scale bars, 20 um. Error bars 
show s.e.m. Hoechst nuclear staining is represented in blue. 


Inv study (Fig. 2a—c), indicates that SCs are induced predominantly on 
cell cycle entry. This conclusion is consistent with the observation that, 
after administration of 12-O-tetradecanoyl phorbol-13-acetate (TPA), 
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a drug that stimulates epidermal proliferation, there is an 80% increase 
in clone labelling after low dose of tamoxifen (Supplementary Fig. 4a, 
b). Furthermore, the deceleration in the rate of clonal expansion at 
subsequent times indicates that, after division, SCs re-enter a 
quiescent phase whereas their CP cell progeny go on to proliferate 
and differentiate. For these clones, the quiescent SC ‘mother’ provides 
an anchor to the basal layer (Fig. 3a-f and Supplementary Fig. 3a), 
leading to clonal persistence. Such behaviour is consistent with the 
observation that, in K14 mice, the level of positive regulators of the 
cell cycle, such as Cenb1 and Cdc20, decreased, whereas negative reg- 
ulators of the cell cycle, such as Cdkn1a and Cdkn2a, increased in basal 
cells between 3.5 days and 8 weeks (Supplementary Fig. 4c-e). 

With this hypothesis, if we assume that SC division can also lead to 
all three possible fates (Fig. 3g), the predicted model dynamics pro- 
vides an excellent fit to the clone size data (basal and suprabasal) over 
the wide range of time points up to 1-year post-labelling (Fig. 3h, i, 
Supplementary Fig. 3c, d and Supplementary Methods, section 
‘Theory’). In particular, we find that SCs account for some 50 + 5% 
of the induced proliferative cells. With a SC division rate as low as 
4-6 per year, a factor of 10-20 times slower than CP cells, most of the 
clone size data (from 1 week to 3 months post-labelling) are largely 
fixed by the known characteristics of CP cells. Drawing on the early 
(3.5 day) time point, we are able to deduce that, in common with their 
progenitor cell progeny, SCs undergo population asymmetric self- 
renewal with some 80 + 10% of SC divisions resulting in asymmetric 
fate (one SC and one CP cell), whereas the remaining divisions are 
equally balanced between SC duplication and symmetrical differenti- 
ation into two CPs (Fig. 3g). Finally, although the analysis points at a 
proliferative hierarchy, we cannot rule out the possibility that prolif- 
erative cells may switch reversibly between CP and SC behaviour if the 
transfer rate is sufficiently low. Moreover, the data do not allow us to 
infer whether the balance between SC proliferation and differentiation 
follows from intrinsic (cell autonomous) regulation or is controlled by 
environmental cues (Supplementary Methods, section “Theory’)*’. 


Proliferation dynamics of the IFE 


To challenge these findings, we applied further independent experi- 
mental assays to explore the IFE cell proliferation using a combination 
of cell cycle analysis, 5-bromo-2-deoxyuridine (BrdU) incorporation, 
and pulse-chase experiments using the K5tTA/tet(O)-H2B-GFP 
reporter mouse, that have proved useful in resolving quantitatively 
the proliferation dynamics of hair follicle bulge SCs'*****. We first 
determined the cell cycle profile of basal IFE cells by determining their 
DNA content. These experiments show that around 9% of 
06'CD34~ IFE cells were in the S/G2/M phase of the cell cycle 
(Supplementary Fig. 5a, b). Continuous BrdU administration labelled 
28+ 2% and 57+ 10% of K5-expressing cells after 24 and 72h, 
respectively, indicating an average cell division time of around 
6 + 1 days (Supplementary Fig. 5c, d and Supplementary Methods, 
section “Theory’), consistent with the inferred CP cell division rate. 
To develop a more refined method to quantify proliferation kinetics 
over a longer period, we used K5tTA/tet(O)-H2B-GFP mice (Sup- 
plementary Fig. 6). In the absence of doxycycline (DOX), histone 2B 
domain-green fluorescent protein (H2B-GFP) fusion protein is 
expressed at a very high level in a discrete peak of fluorescence 
(Supplementary Fig. 6a-c, top panels). Following DOX administra- 
tion, H2B-GFP fluorescence decreases by a factor of two after cell 
division’*’* (Supplementary Fig. 6a-c, bottom panels). The discrete 
peak of H2B-GFP fluorescence observed by fluorescence-activated cell 
sorting (FACS) analysis during the chase period allowed an accurate 
quantification of the distribution in the number of rounds of division 
experienced by basal layer cells (46 "CD34  ) at various time points 
following induction (Fig. 3) and Supplementary Fig. 6d). At 3 weeks 
post-DOX administration, dilution of H2B-GFP label shows evidence 
of CP cell proliferation, while a small peak at high levels of fluorescence 
is consistent with around 5% or less of label-retaining cells (LRCs). 
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From the quantification of fluorescence over the time course, the 
distribution of cell divisions was shown to be quantitatively consistent 
with the inferred SC and CP cell dynamics (Fig. 3k and Supplementary 
Methods, section “Theory’). Notably, after 6 weeks of chase, the H2B- 
GFP label has become diluted in the vast majority of basal layer cells 
and only a small minority retain high levels of expression (Supplemen- 
tary Fig. 6d). Intriguingly, these LRCs are localized to the bottom of the 
undulations of the tail epidermis (Fig. 31), which correlate with the 
location of persistent clones derived by the K14 assay (Fig. 3m), and 
are reminiscent of the so-called rete ridges in human epidermis, where 
epidermal SCs were thought to reside*”*. 


Molecular characterization of SCs and CP cells 


To gain further insight into the molecular properties of these two 
functionally distinct pools of epidermal progenitors, we transcrip- 
tionally profiled FACS-purified o«6*CD34~ K14-Cre-ER and 
Inv-Cre-ER YFP-positive IFE cells 3.5 days after tamoxifen admin- 
istration (Supplementary Fig. 7). Comparison of the K14 and Inv 
transcriptome revealed notable differences. All markers previously 
associated with murine and human IFE SCs**’””° were upregulated 
in K14-targeted cells, including «6, «2, «3, B1 integrins and Cspg4 
(Supplementary Fig. 8). Real-time quantitative reverse transcriptase- 
PCR (qRT-PCR) performed on independent biological samples con- 
firmed the upregulation of these integrins in K14-targeted cells 
(Fig. 4a). FACS analysis showed that «64 integrins were expressed 
at a higher and more uniform level in K14- than in Inv-targeted cells. 
The pattern of «261 integrin showed a more notable difference 
between the two populations, with a very high and narrow peak of 
0261 expression in K14-targeted cells compared to the much lower 
and heterogeneous «21 expression in Inv-targeted cells (Fig. 4b, c 
and Supplementary Fig. 9). Gene ontology classification of the genes 
differentially expressed by more than 1.5-fold between K14- and 
Inv-labelled cells revealed that genes preferentially expressed in 
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Figure 4 | Molecular signature of K14 SC and Inv CP cells. a, RT-PCR 
analysis of the expression of representative genes upregulated in K14-Cre-ER- 
targeted cells, compared with the level of expression of the same genes in Inv-Cre- 
ER-targeted cells and total basal IFE cells (n = 3). b, c, Fluorescence histogram 
(b) and geometric mean fluorescence (c) of integrin cell-surface expression in all 
basal IFE cells, K14-Cre-ER- and Inv-Cre-ER-targeted IFE cells, as determined by 
FACS analysis, 3.5 days after tamoxifen (n = 3). d, RT-PCR analysis of the 
expression of representative genes upregulated in Inv-Cre-ER-targeted cells, 
compared to the level of expression of the same genes in basal epidermal K14-Cre- 
ER-targeted cells and total suprabasal IFE cells (n = 3). Error bars show s.e.m. 


260 | NATURE | VOL 489 | 13 SEPTEMBER 2012 


K14-positive cells are enriched with a very high statistical significance 
in functional groups involved in positive and negative cell cycle regu- 
lation (for example, Cend2, Cdc20, Cdknla, Cdkn2a), mitosis (for 
example, Ccnbl, Kifll), chromosome segregation (for example, 
Cenpe), regulation of cell proliferation (for example, Epgn), and 
DNA repair (for example, Brcal) (Fig. 4a, Supplementary Figs 8, 
10a and Supplementary Table 1). By contrast, Inv-Cre-ER labelled 
cells preferentially expressed genes known to control epidermal dif- 
ferentiation*” * (for example, Notch3, Grhl3), keratinocyte differenti- 
ation and keratinization (for example, Spprla, 1b, 2d, 2i)”*, and lipid 
metabolism (for example, Elovl4, 6, 7, Olah), as confirmed by qRT- 
PCR analysis (Fig. 4d, Supplementary Figs 8, 10b and Supplementary 
Table 1). Altogether these data demonstrate that K14- and Inv- 
targeted cells differ by their gene expression profile, and uncover 
many new markers preferentially expressed by IFE SC and CP cells. 

To characterize further the molecular heterogeneity of basal IFE 
cells, we FACS-purified different populations of basal epidermal cells 
(«6 *CD34° ) according to the level of 81 integrin expression which, 
together with «2 integrin, present the most significant difference 
between K14 and Inv-targeted cells at 3.5 days following tamoxifen 
administration (Fig. 4b and Supplementary Fig. 9). We fractionated 
epidermis into three distinct populations: one population (<3%) 
expressing very high levels of $1 integrin (which are likely to be 
enriched in SCs as they are only detected in K14- but not Inv-targeted 
cells), a second population expressing intermediate levels of 81 integrin 
detected in both K14 and Inv-targeted cells (which are likely to rep- 
resent CP cells), and cells expressing low levels of B1 integrin only 
found in Inv-targeted cells (which are likely to represent the differen- 
tiated basal cells and early suprabasal cells) (Supplementary Fig. 11a). 
Strikingly, many markers identified as being upregulated in K14 versus 
Inv were also upregulated in cells expressing very high levels of B1 
integrin compared to cells expressing intermediate or low levels 
(Supplementary Fig. 11b). Similarly, many genes preferentially 
expressed by Inv cells were already upregulated in cells expressing 
intermediate levels of B1 integrin compared to B1 integrin high cells 
but, nevertheless, much less expressed than in cells expressing low 
levels of B1 integrin (Supplementary Fig. 11c). These data demonstrate 
that the expression of many differentiation-associated genes is already 
upregulated at the earliest stage of progenitor commitment, remin- 
iscent of the lineage priming reported in haematopoietic stem cells**”’. 


Long-term SC contribution to wound healing 


In addition to their role in maintenance, adult SCs are implicated in the 
repair of damaged tissue following injury. Lineage tracing has shown 
that hair follicle SCs are mobilized to the wound area and contribute to 
the repair of damaged epidermis'*""*. At the same time, genetic mouse 
mutants, that present a complete absence of hair follicle, heal incisional 
wounds with a slight delay of the reepithelialisation, demonstrating 
that IFE cells are also capable of tissue regeneration*®. However, in 
unperturbed conditions, the contribution of IFE SCs and progenitors 
to wound healing is currently unknown. To address the respective 
contributions of K14-SCs and Inv-CPs during tissue repair, we first 
marked basal progenitors by titrating the dose of tamoxifen so as to 
label Inv and K14-Cre-ER epidermis at roughly the same density 
(4.5 + 0.1 clones per mm” and 3.8 + 0.5 clones per mm” in K14 and 
Inv, respectively), then subjected tail epidermis to full thickness 
excisional wound using punch biopsy, and assessed the contribution 
of marked cells to repair (Fig. 5a). Whole-mount analysis showed 
major recruitment of K14-Cre-ER-labelled IFE cells to the wound area, 
with clones migrating from the periphery towards the centre of the 
wound (with only 7.5% of clones from hair follicle), persisting long- 
term after wounding (2.6 + 0.1 clones per mm after 35 days) (Fig. 5b, c 
and Supplementary Fig. 12), and consisting of very large clones with a 
broad basal attachment and huge numbers of differentiated cells 
(Fig. 5d). In sharp contrast, few Inv-Cre-ER-targeted cells were 
recruited to the wound area, and these clones were, on average, much 
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Figure 5 | Massive and sustained contribution of K14 SC during wound 
healing. a, Scheme representing the experimental strategy used to label basal 
progenitors and assess their contribution during wound healing. 

b, Microscopic analysis of YFP immunostaining performed on whole-mount of 
wounded tail epidermis 4 weeks after tamoxifen administration to K14-Cre- 
ER/RosaYFP and Inv-Cre-ER/RosaYFP mice and analysed at different times 
following wounding. Dashed lines represent the wounded area. Scale bar, 

200 jum. ¢, Confocal analysis of whole-mount immunostaining of YFP and B4 
integrin 21 days following wounding, showing the departure of K14 YFP 
labelled from the IFE. The arrow indicates the wound localization. Scale bars, 
20 um. d-f, Confocal analysis of representative clones from wounded tail 
epidermis 35 days post-wound, showing very big clones containing multiple 
basal cells in K14 targeted cells (d), whereas the rare still visible Inv marked 
clones are either completely detached from the basal epidermis (e) or very small 
containing one or few basal cells (f). Dashed lines represent the basal lamina. 
Scale bars, 20 jm. Hoechst nuclear staining is represented in blue. 


smaller. At 35 days post-injury, few Inv clones survived (0.3 + 0.3 
clones per mm’) and consisted of either clones of cells being shed with 
no basal attachment (Fig. 5e), or small clones with one or few basal cells 
(Fig. 5f). These data demonstrate that K14-derived SC are capable of 
extensive tissue regeneration, whereas Inv CP cells have only a limited 
contribution to wound healing. 


Discussion 


Our study demonstrates the existence, hierarchical organization and the 
proliferation dynamics of two distinct types of progenitors that effect 
different functions during homeostasis and repair of the IFE in adult 
mice. Quantitative modelling of clonal fate data suggest that both the 
slow-cycling SCs and the more rapidly cycling CP cells share a similar 
pattern of population asymmetric self-renewal in which the balance 
between proliferation and differentiation is achieved through stochastic 
fate choice (Supplementary Fig. 13). These findings provide a reconcili- 
ation of seemingly contradictory theories of IFE maintenance*”, and 
explain the proliferative heterogeneity previously reported in IFE**””*. 
The existence of slow-cycling SCs and more rapidly cycling progenitors 
is reminiscent of the situation encountered in other tissues including 
cornea®’, hair follicle*’, blood*!“”, muscle*® and brain“. In all of these 
tissues, these slow-cycling SCs can switch rapidly and reversibly between 
quiescence and activity following injury and/or drug treatment’. 
Indeed, this partitioning of function—progenitors to undertake routine 
maintenance and quiescent SCs to effect repair—may represent a generic 
strategy of tissue maintenance. 

The demonstration that the long-term wound healing potential is 
dominated by the IFE SC compartment may suggest that either SC 
and CP cell populations are not interchangeable, in contrast to pro- 
genitors in intestinal epithelium”, or that their reversion rate is suffi- 
ciently slow to render its effects negligible. The seemingly short-lived 
contribution of Inv CP cells to the repair of the epidermis is remin- 
iscent of the fate of K15 bulge-derived SCs during wounding”, and 
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demonstrates the critical role of IFE SCs for the long-term repair of 
the epidermis, as has been proposed"®. 


METHODS SUMMARY 


Clonal YFP expression in the tail IFE was performed by administrating 0.2 mg and 
2.5mg tamoxifen to K14-Cre-ER/RosaYFP and Inv-Cre-ER/RosaYFP mice, 
respectively. Immunostainings were performed as described’. Quantification of 
the proportion of surviving clones as well as the basal, suprabasal and total clone 
size was determined by counting the number of YFP ~ cells using whole-mount tail 
epidermis, analysed by confocal microscopy. BrdU pulse and H2B-GFP label 
retention was performed as described’*“*. The wound healing assay was performed 
by administrating 1 mg and 10 mg tamoxifen to K14-Cre-ER/RosaYFP and Inv- 
Cre-ER/RosaYFP mice, respectively. Four weeks later punch biopsies were per- 
formed in the tail epidermis and analysed at different time points. Mathematical 
modelling of the clonal fate data was performed as described in Supplementary 
Methods, section ‘Theory’. For further details see Supplementary Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mice. K14-Cre-ER'’ and TRE-mCMV-H2B-GFP” transgenic mice were 
provided by E. Fuchs. K5-tTA mice** were a gift from A. Glick. RosaYFP mice” 
were obtained from Jackson Laboratory. Involucrin-CreERT2 were previously 
described*®. Mouse colonies were maintained in a certified animal facility in 
accordance with European guidelines. These experiments were approved by the 
local ethical committee (CEBEA). 

Targeting YFP expression. For lineage tracing experiment, K14-Cre-ER/ 
RosaYFP and Involucrin-CRERT2/Rosa-YFP mice were induced between 2 
and 4 months with 0.2 mg and 2.5 mg of tamoxifen (Sigma-Aldrich), respectively, 
by intraperitoneal injection. 

For cell sorting, K14-Cre-ER/RosaYFP and Inv-Cre-ER/RosaYFP mice were 
induced with 1 mg and 10 mg of tamoxifen, respectively, by intraperitoneal injec- 
tion and isolated 4 days later. 

Proliferation experiments. For H2B-GFP chase, K5-Tet off /TRE-mCMV-H2B- 
GEP were treated once with 200 ll of doxycycline (2 mg ml ') by intraperitoneal 
injection and simultaneously continually fed with doxycycline food (1 gkg ') 
until animal euthanasia. For BrdU experiment, mice were injected every 12 h with 
200 ul of BrdU (10 mg ml‘) and analysed 24h and 72h after the first injection. 
Wound experiments. After mice anaesthesia (5% xylazine 10% ketamine in 
PBS), circular pieces of epidermis were removed from K14-Cre-ER/Rosa YFP 
and Inv-Cre-ER/RosaYFP tail epidermis, 4weeks after tamoxifen induction, 
using a 3mm diameter biopsy punch (Stiefel). 

Histology and immunostaining. Skin epidermis was removed from tail bone 
and pre-fixed overnight in 4% paraformaldehyde at 4°C. Tissues were washed 
three times in PBS for 5 min and incubated overnight in 30% sucrose in PBS at 
4 °C. Tissues were then embedded in OCT and kept at —80 °C. Sections of 6 pm 
were cut using a CM3050S Leica cryostat (Leica Mycrosystems). 

Sections were incubated in blocking buffer (1% BSA, 5% horse serum, 0.2% 
Triton in PBS) for 1h at room temperature. Primary antibodies were incubated 
overnight at 4°C or 1h at room temperature in the dark. Sections were rinsed 
three times in PBS and incubated with appropriate secondary antibodies diluted 
to 1:400 and Hoeschst in blocking buffer for 1 h at room temperature. Sections 
were again washed three times with PBS. The following primary antibodies were 
used: anti-involucrin (rabbit, 1:1,000, Covance), anti-K5 (rabbit, 1:1,000, 
Covance), anti-K10 (rabbit, 1:1,000, Covance), anti-§4 (rat, 1:200, BD 
Biosciences), anti-GFP (rabbit, 1:1,000, Molecular Probes), anti-GFP (chicken, 
1:1,000, Abcam). The following secondary antibodies were used: anti-rabbit, 
anti-rat, anti-goat, anti-chicken conjugated to Alexa Fluor 488 (Molecular 
Probes), to rhodamine Red-X (Jackson Immunoresearch) or to Cy5 (Jackson 
Immunoresearch). Nuclei were stained in Hoechst solution (1:2,000) and slides 
were mounted in DAKO mounting medium supplemented with 2.5% Dabco 
(Sigma). 

Epidermal whole-mount. Pieces of skin tail were incubated in EDTA (20 mM) 
ona rocking plate at 37 °C for 1 h. Epidermis was separated from the dermis as an 
intact sheet and washed 2 times with PBS. Pieces of epidermis were pre-fixed in 
4% paraformaldehyde overnight at 4°C or 1h at room temperature. Epidermis 
were rinsed 2 times with PBS for 5 min and conserved in PBS with 0.2% azide at 
4 °C. Small pieces of epidermis were incubated in blocking buffer (1% BSA, 5% 
horse serum, 0.8% Triton in PBS) for 3h at room temperature on a rocking plate 
(100 r.p.m.). The samples were incubated in primary antibodies overnight at 
room temperature on the rocking plate. The primary antibodies used were the 
following: anti-K5 (rabbit, 1:200, Covance), anti-f4 (rat, 1:100, BD Biosciences), 
anti-GFP (rabbit, 1:200, Molecular Probes), anti-GFP (chicken, 1:200, Abcam). 
Samples were then washed 3 times in PBS with 0.2% tween for 1 h and incubated 
in appropriate secondary antibodies diluted 1:400 in blocking buffer overnight at 
4 °C on the rocking plate. Pieces of epidermis were then washed with PBS 3 times 
for 1h. For BrdU staining, samples were incubated in HCI 1 M at 37 °C for 20 min, 
washed with PBS 0,2% tween, stained with anti-BrdU (rat, 1:200, Abcam) in 
blocking buffer and with appropriate secondary antibody after several washes. 
Nuclei were stained in Hoechst solution diluted 1:1,000 for 30 min and mounted 
in DAKO mounting medium supplemented with 2.5% Dabco (Sigma). 

TPA experiment. 12-O-tetradecanoyl-phorbol-13-acetate (TPA) from Sigma 
has been diluted in acetone (final concentration 25 mg ml '). K14-Cre-ER/ 
RosaYFP mice were topically treated with TPA on tail epidermis during 11 days, 
induced with tamoxifen the 8th days. The 12th day, treated mice were killed and 
processed to get whole-mounts of tail epidermis. 

Microscope image acquisition and quantification. All pictures of section 
immunostaining were acquired using the Axio Imager M1 Microscope, the 
AxioCamMR3 or MrC5 camera and using the Axiovision software (Carl Zeiss). 
Acquisitions were performed at room temperature using X20 numerical aperture 
(NA) 0.4and X40 NA 0.75 EC Plan-Neofluar objectives (Carl Zeiss). All confocal 
pictures on whole-mount were acquired at room temperature using a Zeiss 
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LSM780 confocal microscope fitted on an Axiovert M200 inverted microscope 
equipped with a X40 NA 1.2 C-Apochromat water immersion objective (Carl 
Zeiss MicroImaging GmbH). Sequential scannings of 1,024 x 1,024 pixels, 
z-stack 0.3 um, were acquired using the ZEN 2010 software (Carl Zeiss). 
Number of basal clones per interfollicular region were quantified on frozen 
section using Axio Imager M1 microscope for each time point and represented as 
percentage of basal clone. Clone sizes were quantified on whole-mount tail 
epidermis by acquiring individual clones using LSM 780 confocal microscope 
and the number of basal and suprabasal cells were counted. 
Dissociation of epidermal cells. Skin epidermis (from CD1, K5tTA/tet(o)- 
H2BGFP treated or not with DOX, K14-Cre-ER and Inv-Cre-ER/RosaYFP 
induced with tamoxifen for 3.5 days or 8 weeks) was removed from tail bone 
and incubated overnight in HBSS (Gibco) 0.25% trypsin (Gibco) at 4°C. 
Epidermis was separated from the dermis and incubated on a rocking plate 
(100 r.p.m.) at room temperature for 5 min. Basal cells were mechanically sepa- 
rated from the epidermis by flushing 10 times under the epidermis. Tissues were 
then cut in pieces of 1 mm” with scalpel and trypsin was neutralized by adding 
DMEM medium (Gibco) supplemented with 2% Chelex fetal calf serum (FCS). 
Samples were filtrated on 70- and 40-1m filter (Falcon). 
Cell labelling, flow cytometry and cell sorting. Using a concentration of 
20 millions cells per ml, cells were incubated in 2% FCS/PBS with primary anti- 
bodies for 30 min on ice, protected from the light, with shaking every 10 min. 
Immunostaining for isolating K14 and involucrin cells was performed using 
biotin-conjugated anti-CD34 (clone RAM34; BD Biosciences) and phycoerythrin- 
conjugated anti-«6 integrin (clone GoH3; BD biosciences). Primary antibodies were 
washed with 2% FCS/PBS and cells incubated for 30 min in allophycocyanin- 
conjugated streptavidin (BD Biosciences) secondary antibodies, on ice, with shak- 
ing every 10 min. Living K14 and involucrin expressing epidermal cells were gated 
by forward scatter, side scatter, negative staining for Hoechst dye and by following 
the YFP signal. Basal cells from the interfollicular epidermis were targeted using 
CD34-negative «6-integrin-positive gating. Immunostaining for isolating basal B1 
high, intermediate and low cells was performed using biotin-conjugated anti-CD34 
revealed with a phycoerythrin-Cy7-conjugated streptavidin (BD Biosciences) and 
anti-CD29 allophycocyanin-conjugated (31 integrin, clone Hmb1-1, eBiosciences). 
Single living, CD34-negative, 81 low/medium/high cells were sorted. B1 low, 
medium and high gates were set up on the basis of its expression in K14-Cre-ER- 
and Inv-Cre-ER/RosaYFP-targeted basal IFE cells. Fluorescence-activated cell 
sorting analysis was performed using FACSAria I at high pressure (70 p.s.i.) and 
FACSDiva software (BD Biosciences). Sorted cells were harvested directly in the 
lysis buffer provided by the RNeasy microkit (QIAGEN) supplemented with 1 ul of 
beta-mercaptoethanol for every 100 pl of lysis buffer. RNA extraction was per- 
formed on freshly sorted cells according to the manufacturer’s protocol. The entire 
procedure was repeated in at least three biologically independent samples. Other 
integrin antibodies were used to determine their expression in the different cell 
populations: anti-CD49b phycoerythrin-conjugated (02 integrin, clone Hma2, BD 
biosciences), anti-CD29 allophycocyanin-conjugated, and anti-CD104 rat (B4 
integrin, clone 346 11A, BD Biosciences) revealed with an anti-rat Rhodamine 
Red x-conjugated (donkey, JacksonImmunoResearch). 
Total suprabasal and total basal IFE cells isolation for qPCR controls. For total 
suprabasal IFE control, skin epidermis was removed from tail bone and incubated 
1h in EDTA (20 mM) ona rocking plate at 37 °C. Epidermis was separated from 
the dermis and basal cells were flushed out from the epidermis. The piece of 
epidermis containing the suprabasal cells was then frozen in liquid nitrogen. 
RNA extraction was performed using RNeasy microkit (QIAGEN): the tissue 
was cut in small pieces and lysed in RLT buffer containing p-mercaptoethanol 
according to the manufacturer’s protocol. For total basal IFE qPCR control, cells 
were obtained by gating the whole «6* (low medium and high) and CD34~ 
population in FACS sorting. 
RNA quality. Extracted RNA quality was tested using capillary electrophoresis 
(Agilent Bioanalyzer, Agilent RNA 6000 Nano Kit). Degraded RNA samples or 
presenting sign of degradation or containing high proteins or salt contamination 
were discarded. 
Microarray analysis. Cohort of 10 Inv-Cre-ER/RosaYFP and K14-Cre-ER/ 
RosaYFP mice were used to isolate at least 100,000 basal IFE cells (06* CD34 — 
YFP") per replicate. Total RNAs were isolated from these sorted cells and were 
labelled and hybridized on a mouse genome 430 2.0 array. Microarrays were 
performed in duplicate for the untreated K14-Cre-ER/RosaYFP and tamoxifen- 
treated K14-Cre-ER/RosaYFP and Inv-Cre-ER/RosaYFP sorted cells. All the 
results were normalized using the frozen robust multiarray analysis (fRMA) 
normalization using R-bioconductor package fRMA°*°”' with standard parameters. 
Genetic signatures were obtained by considering genes presenting a fold change 
greater on smaller than 2 or —2, respectively. 
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Reverse transcription and quantitative PCR. Cohort of 10 Inv-Cre-ER/ 
RosaYFP and K14-Cre-ER/RosaYFP mice were used to isolate at least 100,000 
basal IFE cells (%6* CD34” YFP") per replicate. Each RNA sample was quan- 
tified using a nanodrop spectrophotometer. RNA quality and quantity used for 
qRT-PCR was exactly the same as for the RNA used for Microarray analysis. 
Purified RNA (200ng) was used to synthesize the first-strand cDNA using 
Superscript II (Invitrogen) and random hexamers (Roche). Mock was obtained 
following the same procedure without adding Superscript II. Quantitative PCR 
analyses were performed with 1 ng of complementary DNA as template, using a 
Brilliant II Fast SYBR QPCR Master Green mix (Stratagene) and an Agilent 
Technologies Stratagene Mx3500P real-time PCR system. 

All primers were designed using the Assay Design Center (Roche applied science, 
https://www.roche-applied-science.com/sis/rtpcr/upl/index.jsp?id=uplct_030000). 
The list of primers are shown in the table underneath. The linearity of each pair of 
primer used has been tested with the following dilution of cDNA: 8 ng, 2 ng, 0.5 ng, 
0.125 ng. 

Analysis of results was performed with Mxpro software (Stratagene). Delta 
delta CT were used to calculate the relative expression of involucrin samples to 
the K14 samples using the housekeeping gene TataBox. 

Primers. Gene symbol, forward primer and reverse primer (5’ to 3’) are: Agpat3, 
ccatcagcaagcacctatacc and cactccaggagcatgacca;Ccnbl, tgcattttgctccttctcaa and 
caggaagcagggagtcttca; Ccnd2, caccgacaactctgtgaagce and tccacttcagcttacccaaca; 
C36, ttgtacctatactgtggctaaatgaga and cttgtgttttgaacatttctgctt; Cd55, actgttgattgee 
acgatgag and tggtggctctggacaatgta; Cdc20, acatcaaggcgctgtcaag and aatgtgccggtc 
actggt; Cdca5, cacagtgacttatgtaggaactggtc and caccccgttcaccaatgt; Cenpe, 
tctttaccgtctgaggtggaa and ggagctcttcagatttctcataca; Cenpp, gcagaatgctgcaaaacg 
and caaagattcccactcctcaga; Degs1, tcaccttcaatgtgggctatc and cacttgcgatcttcctcacc; 
Dgat2, ggcgctacttccgagactac and tggtcagcaggttgtgtgtc; Dill, gggcttctctggcttcaac 


and taagagttgccgaggtccac; Dscl, gggagcaccttctctaagca and ttttgacaggcatcacaaaat 
aa; Elovl4, acgacaccgtggagttctatc and gcggccagtctgctacac; Elovl6, cagcaaagcacccg 
aacta and aggagcacagtgatgtggtg; Elovl7, tctcagtcgccaagagcaa and acagctcgatga 
atttggaga; Epgn, ggctctgggesttctgatag and cctctgcttcttcgctcagt; Fa2h, tegtegactgge 
acaagg and gttggtgaacccactcatca; Fads6, ccacccttatctccatgtcaa and cttgtccggggag 
aacatag; Grhl3, aaggaagatgtcgaatgaacttg and tcgtcctcattactgtagggaaa; Il 1r2, 
cccatccctgtgatcatttc and gcacgggactatcagtcttga; involucrin, atgtcccatcaacacacactg 
and atgtcccatcaacacacactg; Itga2, gggaccggagectttcta and tgcataatactgatttccacactg; 
Itga3, aggatatgtggcttggagtga and gaccacagcaccttggtgta; Itga6, attcaggagtagcttgetg 
gat and cttatttctcttgaagaagccacac; Itgb1, atgcaggttgcggtttgt and catccgtggaaaacacc 
ag; Kifll, aaagaggaaaagggcaggaa and ccgctctgccagattaaatg; Kifl4, ggctatttgcaat 
tgtttttge and tttgagcctctttaaccatcg; Kif2c, cgaaggaggtaccacaaaagg and ttcggtc 
gtaagggaagaag; Lass4, gcctgcatcttgctttctg and ctgccacagccactcactc; Lpl, ctggt 
gegaaatgatgtge and tggacgttgtctagegesta; Notch3, agctggetcctgaggtgat and agacag 
agccggttgtcaat; Nrpl, gaggaatgttctgtcgctatga and ccaatgtgagggccaact; Olah, 
ggccgaaagattaatggctta and gettcttcaaatcggstttct; S100a3, gggacacccagttgetage 
and gcacacgatggcagctact; T-box, gtaccgcagcttcaaaatattgtat and aaatcaacgcag 


ttgtgcgtg. 
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Ploughing the deep sea floor 
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& Antoni M. Calafat? 


Bottom trawling is a non-selective commercial fishing technique 
whereby heavy nets and gear are pulled along the sea floor. The 
direct impact of this technique on fish populations’” and benthic 
communities** has received much attention, but trawling can also 
modify the physical properties of seafloor sediments, water- 
sediment chemical exchanges and sediment fluxes®*®. Most of the 
studies addressing the physical disturbances of trawl gear on the 
seabed have been undertaken in coastal and shelf environments”*, 
however, where the capacity of trawling to modify the seafloor 
morphology coexists with high-energy natural processes driving 
sediment erosion, transport and deposition’. Here we show that 
on upper continental slopes, the reworking of the deep sea floor by 
trawling gradually modifies the shape of the submarine landscape 
over large spatial scales. We found that trawling-induced sediment 
displacement and removal from fishing grounds causes the 
morphology of the deep sea floor to become smoother over time, 
reducing its original complexity as shown by high-resolution 
seafloor relief maps. Our results suggest that in recent decades, 
following the industrialization of fishing fleets, bottom trawling 
has become an important driver of deep seascape evolution. Given 
the global dimension of this type of fishery, we anticipate that the 
morphology of the upper continental slope in many parts of the 
world’s oceans could be altered by intensive bottom trawling, 
producing comparable effects on the deep sea floor to those 
generated by agricultural ploughing on land. 

Continental slopes are at the transition between shallow continental 
shelves and deep basins and are often cut by submarine canyons. 
They represent the steepest part of continental margins, which extend 
from the shelf break, usually at depths of 100-200 m, to the upper 
continental rise at depths of 1,500-3,000m. At fine spatial and 
temporal scales (that is, excluding tectonics), continental-slope 
morphological features form as a result of mechanisms governed by 


erosional and depositional processes’. Slope failures and sediment 
gravity flows'', geostrophic contour-flowing bottom currents’’, 
internal waves’’ and dense shelf water cascading’* are some of the 
main processes shaping continental slopes. 

Bottom trawling is recurrently practised in numerous continental 
slopes of the world’s oceans (Fig. 1). Driven by technical improve- 
ments and the exhaustion of traditional fish stocks, trawling fleets have 
progressively expanded their activities from the continental shelf 
towards deeper regions'*"*, where background energy levels controlling 
natural sedimentary processes are generally low except for infrequent, 
localized events'*"*. 

The northwestern Mediterranean continental slope is a region where 
deep-sea trawling has been practised for a long time. A monospecific 
fishery targeting blue and red deep-sea shrimp Aristeus antennatus 
(Risso, 1816) was active in this area as early as the 1930s (ref. 17). 
However, fishing effort (a measure of the amount of fishing, generally 
defined as the fleet capacity times days at sea) was negligible until the 
mid-1960s, when the trawling fleet became fully industrialized. This 
trend notably accelerated in the 1970s in terms of engine power’® and, 
subsequently, of gear size, fishing depth and trawled area per haul. The 
slope fishing grounds in the northern Catalan margin are currently 
being regularly exploited by 180 large bottom trawlers (that is, each 
over 15 m in overall length) down to 800 m depth, with greater depths 
being reached occasionally (Fig. 2). 

Time series observations were carried out on the flank of a trawled 
submarine canyon (La Fonera Canyon, Fig. 2) to measure sediment 
transport induced by fishing activities. Following an earlier study on 
sedimentary processes’’, an instrumented mooring was placed at 
980 m depth within a tributary valley on the canyon flank to capture 
the occurrence of trawl-induced sediment gravity flows. The new 
observations reveal a highly active scenario with the nearly daily 
occurrence of sediment transport events linked to the passage of the 


Figure 1 | Map of known trawling grounds on the world continental slopes. Red areas illustrate the extent of the trawling and the letters refer to the main fishery 
operating in each area, as detailed in the Supplementary Information. Darker blue regions represent water depths exceeding 2,000 m. 


Marine Sciences Institute, CSIC, E-08003 Barcelona, Spain. 2CRG Marine Geosciences, University of Barcelona, E-08028 Barcelona, Spain. 


286 | NATURE | VOL 489 | 13 SEPTEMBER 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


i mF 364 


E 
o 
ne} 
= 
< 


oO 
= 
a 
oo 
oO 
[op) 
oO 


Er 
= 


2°O0'O"E 2° 30°0"E 3°0'0"E 3° 30’ 0” E 


Figure 2 | Bathymetric map of the study area. Bathymetric map of the 
northern Catalan margin (northwestern Mediterranean) showing the main 
trawling grounds on the open continental slope and canyon flanks (blue 
hatching). Data was obtained from the VMS. The largest ports of the area 
(vessels in white circles) and the 800-m isobath (white thick contour) are 
highlighted. The location of Fig. 4 and the mooring station (red star in white 
box; see Fig. 3 and Supplementary Fig. 1) in La Fonera (also named Palamos) 
Canyon are indicated. 


trawling fleet upslope of the mooring site (Fig. 3). Increases of near- 
bottom suspended sediment concentrations and current velocities from 
trawl-induced sediment gravity flows were observed repeatedly during 
weekdays at working hours, indicating a periodic sediment removal 
from fishing grounds. Such flows reached maximum downslope 
velocities of up to 38cms ' and concentrations of up to 236mgl ' 
(Fig. 3; Supplementary Fig. 1a). Maximum instantaneous sediment 
fluxes at 4-6 m above the sea floor (m.a.s.f.) were 34gm 7 s | and 
accounted for a cumulative transport of about 1.1 tonnes m * during 
the observational period (Supplementary Fig. 1b). This near-bottom 
transport is of the same order of magnitude as the wintertime down- 
slope cumulative transport induced by storms and dense shelf water 
cascading in the nearby Cap de Creus and Lacaze-Duthiers submarine 
canyon heads’. Integrating the sediment transport from 4 to 
50 m.as.f, a minimum of 5.4 X 10° tonnes of sediment is estimated 
to have been exported downslope through the monitored tributary 
valley in 136 days as a consequence of trawling activities (Supplemen- 
tary Fig. 1c). These recurrent trawl-induced low-concentration sedi- 
ment gravity flows only travel a few kilometres downslope from the 
canyon flank to the main canyon axis where they deposit their sediment 
load’’. Consequently, after the industrialization of the fishing fleet, a 
twofold increase of the sediment accumulation rates occurred in the 
canyon axis’®. 

In 2007 a high-resolution multibeam bathymetry survey was con- 
ducted in La Fonera Canyon (Fig. 4). Analysis of the bathymetry data 
revealed a noticeable smoothing of bottom topography along the 
northern canyon flank at depths shallower than 800 m (Fig. 4a), which 
had been tentatively interpreted as being caused by dense shelf water 
cascading flowing southwards’’. Canyon flank smoothing, however, 
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Figure 3 | Time series observations of trawling-induced sediment gravity 
flows. Graphs showing near-bottom suspended sediment concentration (SSC) 
and current speed records observed at the northern flank of La Fonera Canyon 
during a two-week period at different heights above the sea floor (m.a.s.f,, 
metres above sea floor). Sediment transport events were observed during 
weekdays (that is, working days for the fishery), but not on weekends. The detail 
of a one-day record (21 June 2011) shows two sharp increases in SSC and the 
consequent intensification of the current speed (directed downslope towards 
192°), with higher velocities closer to the sea floor that denote the occurrence of 
sediment gravity flows. The two peaks correspond to the two main hauls of the 
trawling fleet, the first one heading offshore and the second one heading to port. 
Orange bars on the x axes denote working days and working hours, 
respectively. Note the change in scale of the current-speed y axes. See mooring 
location in Figs 2 and 4 and Supplementary Figs 2 and 3. 


was also observed locally in the southern canyon flank (Fig. 4a), away 
from the region potentially affected by cascading flows, challenging 
previous hypotheses. The fact that the lower limit of the smoothed 
bathymetric range coincided with the maximum trawled depths 
reported by the fishermen operating in the area pointed to trawling 
as a potential seafloor shaping agent within this depth range. 
Plotting four years (2007-2010) of satellite-based navigation tracks 
from all large bottom trawlers operating in the area on top of the 
multibeam bathymetry of La Fonera Canyon revealed that navigation 
tracks coincided with the smoothed canyon flanks at depths shallower 
than 800 m (Fig. 4b) in a region characterized by homogeneous slopes 
(Fig. 4c) and low rugosity (a dimensionless measure of surface roughness) 
values (Supplementary Fig. 2). In contrast, untrawled canyon flank 
segments are dominated by a dense network of tributary valleys that 
progress upslope from the main canyon axis, reaching up to five orders 
of bifurcation*’. Such complex morphology is missing in the trawled 
depth range and slightly below. Here the sea floor becomes smoother 
and only the main branches of the tributary valley networks are 
preserved (Fig. 4, Supplementary Fig. 3). Therefore, a causative 
relationship can be established between trawling-induced sediment 
erosion effects and the reduction of morphological complexity in the 
studied area. Contrasts between untrawled and trawled seafloor 
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Figure 4 | Seafloor morphology and bottom trawling in La Fonera Canyon. 
a, Shaded relief image from a 15-m resolution bathymetric grid showing gullied 
and smoothed seafloor regions, the latter at depths shallower than 800 m (white 
contour) especially on the northern flank but also in parts of the southern flank. 
b, Positioning data (small yellow arrows) of bottom trawling vessels fishing 
from 2007 to 2010. c, Slope map illustrating regions of homogeneous values 
between 10° and 30° in the most heavily trawled regions of the canyon flanks 
compared to untrawled regions where the gullied sea floor and the tributary 
valley networks are apparent, displaying higher (>40°) and lower (<10°) 
slopes in the walls and axes of the various valleys, respectively. The positions of 
the mooring station (red star; see Fig. 3 and Supplementary Fig. 1), sediment 
cores (green dots; see Supplementary Fig. 4) and ROV images (green triangles; 
see Supplementary Fig. 5) are also indicated. 
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regions are also evident in sediment cores (Supplementary Fig. 4) and 
remote operated vehicle (ROV) footage from canyon flanks 
(Supplementary Fig. 5). Considering only the satellite-based naviga- 
tion tracks from 2010, we determine that an area of 41.6km? was 
affected in the northern canyon flank alone. This represents 76.3% 
of the canyon flank area that is shallower than 800 m. 

To provide an estimate of the volume of sediment that can be remo- 
bilized downslope by trawling activities, the integrated sediment trans- 
port during the observational period (Supplementary Fig. 1c) was 
converted to volume using a mean dry bulk density of 0.6gcm * 
(Supplementary Fig. 4) and extrapolated to one year. Using a drainage 
basin of 4.2 km? (about 10% of the canyon flank trawled during 2010) 
for the monitored tributary valley, approximately 2.4 X 10 “km? is 
estimated to have been exported from this fishing ground annually. 
Assuming an equal sediment yield since the 1970s, following the 
industrialization of the trawling fleet'’, a total of around 1.0 X 10 “km* 
has potentially been removed from this canyon flank by bottom trawl- 
ing, contributing to its morphological change. The actual volume 
remobilized over this period could easily be larger, taking into account 
that the surface sediments from the fishing ground are at present over- 
consolidated (Supplementary Fig. 4) and that downslope sediment 
fluxes in the past, when the morphology was rougher and the surface 
sediments unconsolidated and easier to resuspend, could have been 
much higher. Although the estimated annual volume is similar to the 
annual sediment volumes involved in canyon head failures, the total 
estimated volume remobilized from this canyon flank over four decades 
is two to five orders of magnitude smaller than the sediment volumes 
involved in open-slope submarine failures that occur over geological 
periods of time (Supplementary Table 1). 

Recent studies assessing the impact of anthropogenic activities in 
the marine ecosystems have already highlighted the effects caused by 
bottom trawling, both at global scale” and in deep-sea environments, 
where its spatial extent has been estimated to be an order of magnitude 
greater than the total extent of all other anthropogenic activities”. 
Given that deep-sea bottom trawling has spread around the globe 
during the past decades*'*'® (Fig. 1), the present study raises questions 
over whether bottom trawling is becoming an important driver of the 
upper continental slope seascape evolution during the Anthropocene™, 
at a scale that has been previously unnoticed. 

The relevance of bottom trawling in the alteration of natural sedi- 
ment fluxes over continental shelves has already been pointed out®*, 
but our results stress that, in addition to altering deep-sea sedi- 
ment fluxes, bottom trawling also modifies the shape of the upper 
continental slope, particularly in regions that are strongly affected by 
this fishing technique. In these portions of the margin, the morpho- 
logical complexity, as well as benthic habitat heterogeneity, has been 
drastically reduced, potentially affecting species diversity by regulating 
levels of competition, predation and physiological stress**. Such 
seascape alteration could also affect ecosystem functioning, because 
deep-sea ecosystems have evolved in these environments to rely upon 
the original morphological complexity and biogeochemical properties 
of untrawled surface sediments to support benthic organisms. 

This process of seascape spatial levelling has many parallels with 
landscape evolution under anthropogenic pressures. Changes in land 
use during the twentieth century resulted in a reduction of landscape 
diversity almost everywhere through the spreading of arable land and 
timber plantations, which led to the gradual disappearance of both 
major and minor elements of topography”. Bottom trawling has been 
compared to forest clear-cutting”, although our results suggest that a 
better comparison might be intensive agricultural activities. Trawled 
continental-slope environments are the underwater equivalent of a 
gullied hill slope on land, part of which has been transformed into 
crop fields that are ploughed regularly, thus replacing the natural 
contour-normal drainage pattern by levelled areas with a smaller-scale 
contour-parallel alignment of troughs and crests (Fig. 4, Supplemen- 
tary Fig. 5). Differences exist in the processes that operate in submarine 
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and subaerial environments. Although farmers usually plough their 
land a few times per year, at sea trawling can occur on a nearly daily 
basis (Fig. 3, Supplementary Fig. 1). Also, agricultural activities do not 
generate the periodic sediment gravity flows that contribute to remove 
material from fishing grounds. However, the frequent repeated 
trawling (ploughing) over the same ground, involving displacement 
of sediments owing to mechanical redistribution, is what ultimately 
causes the levelling of the surface and produces morphological effects 
similar to those of a farmer’s field. 

Warming, sea-level rise, acidification and changes in ocean circula- 
tion have been listed as the major present and foreseeable human foot- 
prints in the ocean”’. This work argues for the inclusion in that list of the 
modification of the submarine landscape by trawling, along with the 
ecological and biogeochemical consequences that may derive from it. 
Human-induced changes on Earth’s geomorphology, already well 
documented for emerged land”, are also present in submerged regions. 


METHODS SUMMARY 


This study was conducted using a unique, dedicated experimental strategy com- 
bining monitoring of contemporary sediment transport processes, sediment cor- 
ing, remote operated vehicle (ROV) in situ observations and high-resolution 
multibeam bathymetric surveys, coupled with the information provided by sat- 
ellite-based navigation tracks of fishing vessels. Swath bathymetry data from La 
Fonera (also named Palamos) Canyon were obtained during the EUROLEON 
cruise onboard RV Hespérides in October 2007 using two Simrad multibeam 
echosounders: EM-1002S in shallow and EM-120 in deep waters. A bathymetric 
grid of 15-m resolution was generated to obtain the Digital Terrain Model used in 
this study. Contemporary sediment transport processes were measured by an 
instrumented mooring line equipped with a downward-looking 300 kHz acoustic 
Doppler current profiler (ADCP) from Teledyne RDI and three turbidimeters 
(AQUA logger 520 from the Aquatec Group) deployed from 10 May to 22 
September 2011 on the northern flank of La Fonera Canyon, at 980 m depth within 
the Montgri tributary valley axis. Seafloor sediment samples from the canyon 
flanks were collected using a six-tube multicorer from KC-Denmark during the 
HERMIONE-1 cruise onboard the RV Garcia del Cid. A vertical slab of undis- 
turbed sediment was removed from one of the core tubes for X-ray analysis. A 
second core tube was sliced at 1-cm intervals for granulometric analyses. Bottom 
video imagery of La Fonera canyon flanks was recorded in July 2011 during the 
PROMARES-OASIS DEL MAR research cruise onboard RV Sarmiento de 
Gamboa by using a high-definition Kongsberg OE14-502A camera mounted on 
the 2000-m-depth-rated SubAtlantic Super Mohawk 24 ROV Liropus belonging to 
the Spanish Institute of Oceanography. The satellite-based tracking of fishing 
trawlers operating in the study area from the Vessel Monitoring Systems (VMS) 
data for 2007-2010 was provided by the Fishing Monitoring Centre of the Spanish 
General Secretariat of Maritime Fishing (SEGEMAR). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Time series observations. An instrumented mooring line equipped with a 
downward-looking 300kHz acoustic Doppler current profiler (ADCP) from 
Teledyne RDI placed at 82m above sea floor (m.a.s.f.) and three Seapoint 
turbidimeters (AQUA logger 520 from the Aquatec Group) placed at 5, 20 and 
50 m.a.s.f., was deployed from 10 May to 22 September 2011 in the axis of the 
Montgri tributary valley on the northern flank of La Fonera Canyon (41° 52.49’ N; 
3° 20.66’ E), at 980m water depth, slightly deeper than the maximum trawling 
depth (see position in Figs 2 and 4). Turbidimeters were programmed to measure 
Formazin Turbidity Units (FTU) at 1-min sampling intervals in auto-gain mode, 
and FTU readings were converted to estimates of suspended sediment concentra- 
tion (SSC) according to ref. 31 using standard methods. The ADCP was pro- 
grammed to sample at 5-min intervals with 60 pings per ensemble at minimum 
time ping (standard deviation 0.9cms ') and a depth cell size of 2m, and 
recorded data until 23 September 2011. Owing to acoustic interference with the 
steep sea floor, the lowest cell (that is, the one closest to the bottom) with reliable 
current data was at 12 m.a.s.f. and the highest cell was at 80 m.a.s.f. 

Sediment fluxes. To estimate the down-slope sediment transport at the mooring 
location, the north and east current components from each ADCP cell were 
transformed to along-valley and across-valley current components, using an 11° 
clockwise rotation from north as the orientation of the tributary valley axis. 
Downslope sediment fluxes were then obtained by multiplying the along-valley 
current velocity from each ADCP cell by the estimate of SSC at the same depth 
level, obtained from a linear regression between SSC measurements from 5 to 
20 m.a.s.f., and from 20 to 50 m.a.s.f., averaged at 5-min intervals. Currents from 
12 m.a.s.f. were assumed to be the same down to 4 m.a.s.f. to calculate near-bottom 
fluxes, considering SSC measurements from 5 m.a.s.f. and the currents from 4 to 
6 m.a.s.f. to be the lowest cell. The time-integrated cumulative transport through 
each cell, computed as tonnes per square metre, was multiplied by its height (that 
is, 2m) and the valley width at the water depth of the cell to obtain the mass of 
sediment that passed through the various levels (that is, from 4 to 50 m.a.s.f.). The 
integration of the transport at all levels gave an estimate of the amount of sediment 
that was exported downslope through the tributary valley during the observational 
period (136 days). This value should be considered a minimum estimate, because 
the flux above 50 m.a.s.f. and, more importantly, the flux from 4 m.a.s.f. to the sea 
floor, where most of the sediment transport probably occurs, could not be 
accurately calculated. 

Multibeam bathymetry. Swath bathymetry data were obtained during the 
EUROLEON cruise onboard RV Hespérides in October 2007 using two 
SIMRAD multibeam echosounders: an EM-1002S for shallow depths and an 
EM-120 in deep waters. The EM-1002S works at a frequency of 95 kHz with 
111 beams per ping, whereas the EM-120 works at a frequency of 12 kHz with 
191 beams per ping. Both were operated in equidistant mode, with swath width 
independent of water depth and fixed to 500m. Data were logged using 
Kongsberg’s Seafloor Information System (SIS). Processing of both multibeam 
data sets was conducted using Caris HIPS and SIPS software (http://www.caris. 
com/products/hips-sips/) and included corrections for ship motion, sound speed 
and spurious data. A bathymetric grid of 15-m resolution was generated to obtain 
the final Digital Terrain Model and slope gradient map illustrated in Fig. 4, and a 
50-m bathymetric grid subsample was used to perform the rugosity analysis shown 
in Supplementary Fig. 2. 

Trawlers’ satellite-based tracking. Within the framework of the Common 
Fisheries Policy of the European Union, the Fishing Monitoring Centre of the 
Spanish General Secretariat of Maritime Fishing (SEGEMAR) continuously tracks 
about 1,400 fishing vessels equipped with and operating satellite-based Vessel 


Monitoring Systems (VMS), commonly referred to as ‘blue boxes’. Blue boxes 
are standardized, protected and sealed against power cuts, the marine environ- 
ment and to prevent any manipulation. Each blue box consists of a positioning 
unit and a satellite-based transmission-reception station, along with data logging 
and storage devices. VMS uses the Global Positioning System and Inmarsat-C for 
positioning and data transmission, respectively. Data are stored in the VMS 
internal memory during six months. Blue boxes also have supplementary buttons 
with which to select the vessel’s activity at any time. The VMS provides informa- 
tion on a vessel’s position, direction and speed and is able to determine each 
vessel’s activity (that is, fishing, sailing, drifting or docked) at any time regardless 
of whether it is located in authorized or prohibited waters. The error margin in the 
positioning is less than 100 m and the maximum time between the transmission of 
a message and its reception at the Fishing Monitoring Centre is 10 min. The route 
followed by each vessel is known and its speed reveals whether it is sailing or 
fishing. For the blue and red deep-sea shrimp Aristeus antennatus fishery, 
sustained vessel speeds of less than 3.5 knots imply that the vessel is fishing. The 
VMS information represented in Fig. 4b corresponds to 55 different VMS iden- 
tification codes (that is, vessels) for the period extending from 1 January 2007 to 31 
December 2010. The VMS information in Fig. 4b was filtered by depth and speed. 
Vessel velocities higher than 3.5 knots and depths shallower than 130 m (that is, 
the mean depth of the continental shelf edge in the study area) were removed to 
illustrate better the actual fishing activity in slope waters. Each vessel heading has 
also been illustrated in Fig. 4b by means of small, course-oriented arrows. 
Bottom sediment samples. Seafloor sediment samples were collected in La 
Fonera Canyon using a six-tube KC multicorer during the HERMIONE-1 cruise 
onboard the RV Garcia del Cid. A vertical slab of undisturbed sediment was 
removed from one of the core tubes for X-radiographic analysis. A second core 
tube was sliced at 1 cm intervals for further analyses. Dry bulk density was deter- 
mined as the ratio of freeze-dried weight to volume of wet sediment slice. For 
granulometric analysis, about 1 g of lyophilized sediment sample was treated with 
20% H 02 to remove organic matter and then dispersed by adding sodium 
polyphosphate solution. Grain size analysis of the prepared sample was conducted 
by laser diffraction using a Horiba Partica LA-950V2 particle-size analyser. The 
LA-950V2 detects particles in the range 0.01-3,000 1m, so sediment samples were 
sieved through a 2,000-jum mess screen before analysis and the grain size distri- 
bution obtained by laser diffraction was corrected to include the relative percent- 
age of gravel. 

Remote operated vehicle imagery. Bottom video imagery of trawled and 
untrawled sections of La Fonera canyon flanks was recorded in July 2011 during 
the PROMARES-OASIS DEL MAR research cruise onboard the Spanish RV 
Sarmiento de Gamboa by using a high-definition Kongsberg OE14-502A camera 
mounted on the 2,000-m-depth-rated Sub-Atlantic Super Mohawk 24 ROV 
Liropus belonging to the Spanish Institute of Oceanography. Single frames were 
subsequently extracted from the video records using Videolan (VLC version 1.1.11 
The Luggage) and Avidemux (version 2.5.4) media player software. Underwater 
positioning was ensured by a high-precision HiPAP 350P acoustic system linked 
to the DGPS positioning system of the vessel. Position accuracy of the HiPAP 350P 
is 0.3% of the range and its range detection accuracy is less than 20 cm. 


31. Guillén, J., Palanques, A, Puig, P., Durrieu de Madron, X. & Nyffeler, F. Field 
calibration of optical sensors for measuring suspended sediment concentration 
in the western Mediterranean. Sci. Mar. 64, 427-435 (2000). 
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Stereocontrolled organocatalytic synthesis of 
prostaglandin PGF,, in seven steps 


Graeme Coulthard’, William Erb! & Varinder K. Aggarwal! 


Prostaglandins are hormone-like chemical messengers that 
regulate a broad range of physiological activities, including blood 
circulation, digestion and reproduction’*. Their biological 
activities and their complex molecular architectures have made 
prostaglandins popular targets for synthetic organic chemists for 
over 40 years**. Prostaglandin analogues are widely used as 
pharmaceuticals and some, such as latanoprost, which is used to 
treat glaucoma®®, have become billion-dollar drugs. Previously 
reported syntheses of these compounds are quite lengthy, 
and every chemical step costs time and energy, generates waste 
and is accompanied by material losses. Using a new bond discon- 
nection, here we report a concise synthesis of the most complex 
prostaglandin, PGF,,, with high levels of control of relative and 
absolute stereochemistry, and fewer steps. The key step is an aldol 
cascade reaction of succinaldehyde using proline organocatalysis 
to create a bicyclic enal in one step and an enantiomeric excess of 
98%. This intermediate bicyclic enal is fully primed with the appro- 
priate functionality for attachment of the remaining groups’. 
Access to this bicyclic enal will not only render existing 
prostaglandin-based drugs more affordable, but will also facilitate 
the rapid exploration of related chemical structures around 
the ubiquitous five-membered ring motif, such as potentially 
therapeutic prostaglandin analogues. 

Prostaglandins, such as PGF), (Fig. 1; 1), are hormones that are 
responsible for the control of a myriad of essential biological processes 
from sleep to pain, fever to inflammation, menstruation to birth, and 
constriction of blood vessels to blood clotting'**. They are derived from 
arachidonic acid and are transformed by prostaglandin synthetase into 
a number of structurally related carbocyclic molecules. These sensitive 
and labile molecules are not stored in the body but are synthesized in 
response to stimuli. They were discovered in the early 1930s by von 
Euler? and by the mid-1960s the structures of the first family of 


O 
Nine steps od 
— aF 
Oe. 
AcO 


Q 


HO b 
Eight steps / “N= C,H 
GER 
ne OH 


prostaglandins was uncovered by Bergstrom et al. (see refs 2 and 10 
for reviews). The complex structures of prostaglandins, together with 
their broad spectrum of biological activity, fuelled intense research 
activity into their synthesis, comparable to that generated from 
B-lactam antibiotics and steroids. Woodward"', Corey'’, Stork’’, 
Noyori'*, Danishefsky'? and many others contributed ingenious 
strategies and developed new methodologies of general use in the con- 
struction of these complex molecules*°. But prostaglandins and their 
analogues are not only of academic interest, they have also found their 
way into a considerable number of pharmaceuticals. For example, an 
analogue of PGF, (1), latanoprost (2), is used in the treatment of 
glaucoma’®, and achieved sales of $1.75 billion in 2010 (ref. 17). 

The manufacture of latanoprost® requires 20 steps and uses the 
original strategy developed by Corey et al.'* in the synthesis of the 
related prostaglandin, PGF,,. Corey’s synthesis involved the forma- 
tion of a key intermediate, the Corey lactone (3), in nine steps from 
cyclopentadiene (4). From this lactone, they were able to assemble 
the entire family of prostaglandins’’, and specifically PGF,,, in eight 
further steps’®. 

In our analysis for the synthesis of prostaglandins and in common 
with other syntheses of PGF,,'>'>'? we recognized that lactol 5 was an 
ideal late-stage intermediate because it enabled the incorporation of 
the upper side-chain with control of double-bond geometry through a 
Wittig reaction. At this point our retrosynthetic analysis departed from 
all previous syntheses. We were interested in disconnecting the C12- 
C13 bond”, because we recognized that the bicyclic lactol could 
control the stereochemistry of the 1,4-addition of the lower side-chain 
onto a suitable Michael acceptor 6. By selecting an aldehyde as the 
electron withdrawing group of the Michael acceptor, the enal 7 could 
be disconnected back to a simple aldol dimerization of succinaldehyde 
(8), a process that could be rendered asymmetric through proline (9) 
catalysis”?”?. 
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Figure 1 | Prostaglandins in nature and medicine. a, General schematic of 
Corey’s synthesis of PGF,,. b, The molecular structure of latanoprost (2), a 
PGF,,, analogue used for the treatment of glaucoma. c, Our retrosynthetic 


analysis of PGF, The numbered carbon atoms are referred to in the main text. 
FGI, functional group interconversion. Ph, phenyl. EWG, electron withdrawing 
group. i-Pr, isopropyl, CH(CHs)>. 
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Figure 2 | Potential reaction pathways of the proline-catalysed aldol 
reaction of succinaldehyde. The desired pathway is shown in blue; others are 
labelled (i) to (iii) and described in the main text. 


However, the ‘simple’ concept shown in the retrosynthesis belies a 
highly complex reaction cascade (blue, Fig. 2) with many potential 
pitfalls (black, Fig. 2): (i) the aldol product 10 is required to form 
the less favoured hemiacetal 11, bearing cis substituents in the 
5-membered ring, but not the trans hemiacetal 12; (ii) the hemiacetal 
11 is required to undergo an intramolecular second aldol and eliminate 
to give 7 but aldol 10 itself should not eliminate to give 13; (iii) aldol 10 
is a reactive trialdehyde which will be prone to undergo further aldol 
reactions with succinaldehyde (8) or with itself, leading to 14 and 15 
and ultimately oligomers. 
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Figure 3 | Model studies. a, Model studies to investigate the intermolecular 
proline-catalysed aldol reaction of aldehydes bearing an ester at the 4-position. 
b, Model studies to investigate an intramolecular aldol reaction and 
dehydration. c, A transition state structure accounting for the observed 
enantioselectivity. 
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Perhaps unsurprisingly, the desired aldol proved highly challenging 
to realize in practice: treatment of succinaldehyde with proline in a 
range of solvents and under a variety of conditions did not deliver any 
of the desired bicyclic product but instead gave oligomeric material. 
We therefore deconstructed the reaction cascade to determine 
which of the two steps, the initial aldol step or the second aldol and 
dehydration step, was causing the problem. 

Model aldehyde 16 was treated with proline (9) and aldol product 
17 was obtained as a 3.6:1 mixture of diastereoisomers in moderate 
yield (Fig. 3a)”*. This showed that aldol reactions of aldehydes bearing 
a carbonyl group in the 4-position were suitable substrates for the 
proline-catalysed aldol reaction. 

To test the second aldol and dehydration step, model dialdehyde 18 
was prepared by ozonolysis of the known lactone 19 (Fig. 3b)* 
However, treatment of this dialdehyde with proline (catalyst A) 
provided only low conversion to the expected enal 20, clearly indi- 
cating that the second step was the hurdle in the reaction cascade. We 
therefore explored alternative catalysts and found that [BnJNH,][OCOCF3] 
(21, catalyst B)** was much more effective, giving enal 20 in 51% 
isolated yield (from 19). 

We therefore tested the aldol reaction cascade of succinaldehyde 
with a combination of proline and [Bn.NH>2][OCOCFs] in a range of 
solvents and under a variety of conditions (see the Supplementary 
Information for full details). This time the reaction was successful; 
selected data, illustrative of the process, is summarized in Table 1. 
The reaction could be conducted at (unusually) low loadings of both 
catalysts and at relatively high concentration (2M) (entries 1-3 in 
Table 1). The sequenced addition of the two catalysts and the timing 
was critical to the success of the reaction: if [Bny3NH.][OCOCF3] was 
present at the outset (entry 4 in Table 1), or not added at all (see earlier 
discussion), the reaction failed. We spent some time investigating this 
facet of the reaction because the low yield is a consequence of 
oligomerization of the intermediate trialdehyde. As the concentration 
of the trialdehyde builds up it is prone to undergo further aldol 
reactions with succinaldehyde, leading to oligomers. The second 
catalyst seemed to inhibit the initial proline-catalysed aldol reaction, 
so consecutive addition of the two catalysts was required. The 
optimum time to add the second catalyst was found to be dependent 
on the amount of proline catalyst used: using just 2% proline, the yield 
of the lactol 7 increased with increasing time before the addition of the 
second catalyst (entries 5-8 in Table 1), peaking at around 10h. The 
reaction could also be conducted with just 1% proline but considerably 
longer reaction times were required (entry 9 in Table 1). 


Table 1 | Effect of catalyst loading and time delay on yield 


( seus ca 
H 
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THF (2 M), room temperature zg 


OS Ay —$—$———S 
(ii) [Bn, NH, ][OCOCF,,] 
8 THF (1 M), room 
temperature,14 h No 
7 
Entry (S)-proline (mol.%) Time (h) Yield (%) 
if 10 2 14 
2 5 2 16 
3 5 4 19 
4 2 ) ~2 
5 2 4 10 
6 2 6 16 
vi 2 10 20 
8 2 24 20 
9 1 24 18 


Reactions were carried out on 200 mg of succinaldehyde 8. ‘Time’ refers to the time before 
[Bn2NH2][OCOCF3] (2 mol.%) was added and the reaction was diluted to 1 M. The yield is the NMR 
(nuclear magnetic resonance) yield based on an internal standard (1,3,5-trimethoxybenzene); results 
are an average of five reactions. 
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Figure 4 | A concise asymmetric synthesis of PGF2, (1). Reaction conditions 
are as follows (see the Supplementary Information for full details). (i), HO, at 
75 °C for 4h, then at 115 °C to distil MeOH and H,0O. (ii), (S)-proline 

(2 mol.%), THF (2 M), at room temperature for 20 h, then [Bn,NH,][OCOCF3] 
(2 mol.%), THF (1M), at room temperature for 14h. (iii), MeOH 

(2.0 equivalents), amberlyst 15, MgSO,, CH2Cl:, at room temperature for 14h. 


The reaction was conducted at relatively high concentration, which 
greatly assisted the scale-up of the reaction. Despite the low yield, 
work-up and purification of the enal was straightforward as the 
oligomeric side products could be largely removed by filtration, leaving 
a relatively pure crude material. Partial purification through a plug of 
silica gave the desired enal 7 in about 16% yield and with an enantiomeric 
ratio of 99:1, on the multigram scale. The absolute stereochemistry of 
the product was ultimately established through the synthesis of PGF), 
(1) and follows from the List-Houk model for this type of reaction 
(transition-state structure 22, Fig. 3c)”°. A mixture of diastereoisomers 
of trialdehyde 10 was expected to be formed (a diastereoisomeric 
ratio of about 3.6:1 was expected from results with the model com- 
pound 16) but as the minor diastereoisomer cannot give the alternative 
diastereomeric enal it must be consumed by the formation of oligomers. 

The complete synthesis of PGF;, from commercially available 
materials is shown in Fig. 4. Heating 2,5-dimethoxytetrahydrofuran 
23 (41 g) in water followed by evaporation and extraction gave crude 
dialdehyde”” which was directly subjected to the aldol reaction as 
described above. The hemi-acetal 7 was converted into a 2:1 incon- 
sequential diastereoisomeric mixture of methoxy acetals 24 which 
were carried through the subsequent reaction sequence. On a large 
scale (285 g of 23), further modifications were required owing to the 
partial decomposition of the dialdehyde during the extended time 
needed to remove the larger volume of solvent used for extraction. 
Instead, we found that after distillation of the methanol (MeOH) and 
most of the water, azeotropic removal of the remaining water using 
2-MeTHF (2-methyltetrahydrofuran) provided a solution of the dia- 
Idehyde in a solvent (2-MeTHF) that could be used directly in the aldol 
reaction. Using this improved and simplified protocol, from 109.5 g of 
inexpensive succinaldehyde we were able to obtain 15.0g of pure 
methoxy acetal 24 in 14% yield over two steps. Conjugate addition of 
the mixed vinyl cuprate’* 25 to 4.0 g of methoxy acetal 24 followed by 
trapping with TMSC] (trimethylsilyl chloride) furnished the silyl enol 
ether 26. Subsequent controlled ozonolysis followed by treatment with 
NaBH, gave the alcohol 27 (4.6 g, 49% over two steps). As planned, 
these two steps occurred with complete stereocontrol at the newly 
created stereogenic centres. Finally, simultaneous deprotection of the 
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(iv), 25 (1.1 equivalents), THF, then Me3SiCl, EtsN. (v), 03, CH2Cl,/MeOH 
(3:1), —78 °C, then NaBH, (3 equivalents), at —78 °C to room temperature. 
(vi), 1.5% aqueous HCI/THF (3:2), at room temperature for 16h. (vii), (4- 
carboxybutyl)(triphenyl)phosphonium bromide (6 equivalents), potassium 
tert-butoxide (12 equivalents), THF, at 0 °C to room temperature. TBDMS, 
tert-butyldimethylsilyl. 


acetal and silyl ether with aqueous HCl followed by Wittig reaction with 
the phosphonium salt 29 (ref. 29) gave 1.9g of the target molecule 
PGF, (1), which was identical in all respects to the natural product”. 
Thus, we have developed a short (seven steps) synthesis of 
prostaglandin PGF,,, from inexpensive 2,5-dimethoxytetrahydrofuran 
23. The key step is an organocatalytic aldol dimerization reaction 
of succinaldehyde, which generates the bicyclic enal 7 in high 
enantiomeric ratio and fully primed with functionality suitable to 
introduce the required side chains directly. The aldol cascade required 
proline to perform the first aldol reaction and a second catalyst 
({Bn,NH,][OCOCF;]) to induce an intramolecular aldol reaction 
and elimination. Although the aldol reaction was low-yielding, the 
enantioselectivity was very high, isolation and purification was 
straightforward and the reaction could be conducted on the multi- 
gram scale. Its application in a short synthesis of the most complex 
of prostaglandins, PGF,,,, has been demonstrated. Indeed, the bicyclic 
enal 7 is an ideal building block not just for the cost-effective synthesis 
of the whole family of prostaglandins, but for also exploring the 
chemical space around the ubiquitous five-membered carbocyclic ring 
motif, where other biologically active molecules undoubtedly lie. 


METHODS SUMMARY 
Procedure for formation of methoxyacetal 24. A solution of succinaldehyde 8 
(109.5 g, 1.27 mol) in 2-MeTHF (650 ml) was stirred at room temperature (20 °C) 
(S)-proline (2.93 g, 25.4mmol, 0.02 equivalents) was added and the reaction 
stirred at room temperature for 20h. THF (650ml) was added, followed by 
[Bn2NH2][OCOCF3] (8.43g, 25.4mmol, 0.02 equivalents). The reaction was 
stirred for a further 14h. Celite (60 g) was added to the reaction and the volume 
of the reaction mixture was reduced by three quarters under reduced pressure. tert- 
Butyl methyl ether (TBME) (975 ml) was added slowly and with vigorous stirring 
of the mixture. The mixture was stirred for 20 min before filtration of the resulting 
solids. The solids were washed with TBME (3 X 150ml) and the filtrate was 
concentrated under reduced pressure. The material was purified by column chro- 
matography (about 600 g silica), eluting with petrol/EtOAc (6:4 to 5:5), to give the 
lactol 7 (as an approximately 2:1 mixture of diastereoisomers), as a brown oil. 
This residue, containing 7, was dissolved in CHCl, (190 ml) and stirred at 
room temperature. MeOH (5.47 g, 6.90 ml, 170.6 mmol, 2.0 equivalents based 
on about 13.4% of 7 detected by internal standard in the previous reaction) was 
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added via syringe. Amberlyst 15 (1.22 g) and MgSO, (25.2 g) were added in one 
portion and the reaction mixture was stirred at room temperature for 14h. The 
reaction mixture was filtered through a sinter funnel and the solids washed with 
CH,Cl, (3 X 60 ml). The filtrate was concentrated under reduced pressure and 
purified by column chromatography (about 300g silica), eluting with petrol/ 
EtOAc (9:1 to 4:1), to give the methyl acetal 24 (as an approximately 2:1 mixture 
of diastereoisomers, 15.0 g, 14.0% (over two steps from succinaldehyde)) as a 
yellow oil. See the Supplementary Information for characterization data. 
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Stabilization of Leidenfrost vapour layer by textured 
superhydrophobic surfaces 


Ivan U. Vakarelski'?, Neelesh A. Patankar*, Jeremy O. Marston', Derek Y. C. Chan*? & Sigurdur T. Thoroddsen!* 


In 1756, Leidenfrost' observed that water drops skittered on a suffi- 
ciently hot skillet, owing to levitation by an evaporative vapour film. 
Such films are stable only when the hot surface is above a critical 
temperature, and are a central phenomenon in boiling’. In this 
so-called Leidenfrost regime, the low thermal conductivity of the 
vapour layer inhibits heat transfer between the hot surface and the 
liquid. When the temperature of the cooling surface drops below 
the critical temperature, the vapour film collapses and the system 
enters a nucleate-boiling regime, which can result in vapour explo- 
sions that are particularly detrimental in certain contexts, such as in 
nuclear power plants’. The presence of these vapour films can also 
reduce liquid-solid drag**. Here we show how vapour film collapse 
can be completely suppressed at textured superhydrophobic sur- 
faces. At a smooth hydrophobic surface, the vapour film still col- 
lapses on cooling, albeit at a reduced critical temperature, and the 
system switches explosively to nucleate boiling. In contrast, at 
textured, superhydrophobic surfaces, the vapour layer gradually 
relaxes until the surface is completely cooled, without exhibiting a 
nucleate-boiling phase. This result demonstrates that topological 
texture on superhydrophobic materials is critical in stabilizing the 
vapour layer and thus in controlling—by heat transfer—the liquid- 
gas phase transition at hot surfaces. This concept can potentially be 
applied to control other phase transitions, such as ice or frost forma- 
tion’ °, and to the design of low-drag surfaces at which the vapour 
phase is stabilized in the grooves of textures without heating”. 

Heat transfer and boiling on a heated solid surface in contact with 
liquid are familiar phenomena observed in our daily lives, and occur in 
many industrial processes. As the vessel containing a liquid is progres- 
sively heated, the liquid will start to boil, with vapour bubbles forming 
at the hot surface. This is the nucleate-boiling regime. If the surface 
temperature is increased further, a continuous vapour film will form 
that leads to the film-boiling or Leidenfrost regime. It is known that 
surface properties such as roughness and wettability can alter the 
transition temperature from the nucleate-boiling to the film-boiling 
regime’’'®. As the low thermal conductivity of the vapour layer 
dramatically reduces the efficiency of heat transfer, the traditional 
focus is to increase the temperature at which film boiling occurs”. 
However, if the vapour layer is to be used for drag reduction**'”, 
lowering the temperature of film boiling is beneficial, as the presence of 
a Leidenfrost vapour layer can reduce the hydrodynamic drag force on 
a heated sphere moving in liquid by up to 85%°. 

Takata et al.'* have reported that the initiation of nucleate boiling 
can be suppressed at what they termed “super-water-repellent” sur- 
faces. Here we show that a textured superhydrophobic surface can 
eliminate the collapse of the vapour film, thus making it possible to 
maintain a stable vapour film at all temperatures above the boiling 
point of the liquid—that is, at all superheats. To this end, we investi- 
gated cooling and heating phenomena of heated surfaces that possess 
different degrees of hydrophobicity in water. 


In our cooling experiments we use 20-mm-diameter stainless-steel 
spheres, and in our heating experiments 24-mm-diameter stainless- 
steel cylinders and stainless-steel vessels (capacity 60 ml). The stain- 
less-steel surfaces are modified to have superhydrophilic, hydrophilic, 
hydrophobic or superhydrophobic wetting properties, as characterized 
by water-drop contact angle in air. Hydrophilic surfaces were pro- 
duced by cleaning the steel surface with organic reagents, resulting 
in water contact angles of <30°. Hydrophobic surfaces were produced 
by silanization with trichloro(1H,1H,2H,2H-perfluorooctyl)silane, 
giving contact angles of ~100°. Superhydrophobic surfaces with water 
contact angles of >160° were produced by treatment with a commercial 
coating agent (Glaco Mirror Coat ‘Zero’, Soft 99 Co.) containing 
nanoparticles and organic reagent. Superhydrophilic surfaces, with 
contact angles of <10°, were produced from the superhydrophobic 
surface by plasma cleaning to remove the organic coating, while 
retaining the same texture and geometric morphology (see 
Supplementary Information for details). 

Figure la is an atomic-force micrograph of the superhydrophobic 
surface coating, showing its rough surface topography. The very high 
contact angle subtended by a superhydrophobic sphere at the air- 
water meniscus is shown in Fig. 1b. When the sphere is fully immersed 
in water, its surface has a silver, mirror-like sheen, caused by reflection 
from the thin air layer retained on the surface. This is a signature of the 
Cassie state’!*, in which water is in physical contact only with the 
roughness peaks created by the coating (Fig. 1c). This type of air 
entrapment, called plastron, has been shown to reduce hydrodynamic 
drag on spheres moving in water”, and is used by some aquatic insects 
to facilitate underwater breathing”. 

We study the cooling of superhydrophilic, hydrophilic, hydro- 
phobic and superhydrophobic spheres that have been pre-heated in 
a temperature-controlled furnace and then immersed in a glass tank 
filled with water at 22 °C (Supplementary Fig. 4). The state of the water 
at the sphere surface during subsequent cooling is captured on high- 
speed video, and the sphere temperature is monitored by an inserted 
thermocouple-probe thermometer. The progress of the cooling events 
for the hydrophilic, hydrophobic and superhydrophobic spheres can 
be seen in Supplementary Movies 1, 2 and 3, respectively. The super- 
hydrophilic and hydrophilic spheres were heated to a maximum tem- 
perature of 700°C, but the maximum temperature to which the 
hydrophobic and superhydrophobic spheres could be raised was 
limited to ~400°C by the thermal stability of the organic coating. 
For the superhydrophilic sphere, the maximum temperature was not 
high enough to initiate film boiling; when the sphere came into contact 
with the water, it cooled almost instantaneously, accompanied by a 
vigorous release of bubbles. For all other surfaces, the initial sphere 
temperatures were high enough to be in the film-boiling regime, in 
which a continuous vapour layer coated the spheres, with a single 
bubble dome formed on the top of the sphere by buoyancy effects. 
During the initial stage of cooling, the bubble dome would grow and 
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Figure 1 | Surface topography and images of the superhydrophobic sphere. 
a, Three-dimensional representation of an atomic-force micrograph of a 10- 
jum region of the superhydrophobic surface coating. The combination of high 
surface roughness with an organic coating provides a high water contact angle, 
of >160°. b, Image of the superhydrophobic sphere held on the water surface, 
showing the high contact angle. c, Image of the superhydrophobic sphere 
immersed in water. The silver-mirror sheen of the immersed sphere is due to a 
thin air layer retained on the sphere surface, and is the signature appearance ofa 
Cassie state of the water adjacent to the surface. See Supplementary 
Information for further details of the preparation and characterization of the 
superhydrophobic coating (Supplementary Figs 1, 2). 


pinch off periodically (Fig. 2a). On further cooling, the film-boiling 
mode on the hydrophilic sphere ended with collapse of the vapour 
layer, marked by an explosive release of bubbles (Fig. 2c). This was 
followed by a short phase of rapid cooling in the nucleate-boiling mode 
(Fig. 2d). Film boiling on the hydrophobic sphere also ended with an 
almost instantaneous collapse of the vapour layer. However, for the 
superhydrophobic sphere no vapour-layer collapse was observed. 
Instead, the vapour layer gradually relaxed over the entire sphere 
surface, and when cooling was complete a residual bubble dome 
remained on the sphere apex (Fig. 2b), and the sphere surface exhibited 
the silver sheen indicative ofa final Cassie state of the liquid adjacent to 
the sphere surface (Supplementary Fig. 3). 

Figure 3 shows the measured sphere temperature plotted against the 
immersion time. The superhydrophilic sphere (Fig. 3a) begins its cool- 
ing in the nucleate-boiling regime, and its temperature drops rapidly to 
the pool temperature. The cooling of the hydrophilic (Fig. 3a) and 
hydrophobic (Fig. 3b) spheres begins in the Leidenfrost film-boiling 
regime. In both cases, the vapour layer collapses at the Leidenfrost 
point, indicated by a sharp increase in cooling rate at ~420°C for 
the hydrophilic sphere, and at ~210°C for the hydrophobic sphere. 
However, for the superhydrophobic sphere, film boiling is maintained 
down to the pool temperature (22 °C) without vapour-layer collapse 
and transition to nucleate boiling (Fig. 3b). 

Cooling experiments at elevated water pool temperatures of 80 and 
100 °C confirmed the same trends as the room-temperature cooling 
experiments, with respect to the sphere hydrophobicity (Supplemen- 
tary Fig. 5 and Supplementary Movie 4). In fact, as long as the water 
contact angle at room temperature exceeded 140°, achievable at 
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Figure 2 | High-speed camera snapshots of 20-mm steel spheres cooling in 
water. a, Bubble pinch-off from the vapour dome of a hot, superhydrophobic 
sphere cooling in the Leidenfrost regime. Water pool temperature is 100 °C; 
sphere temperature is T, = 200 °C. b, Final state of the same superhydrophobic 
sphere, cooled to the pool temperature (T, = 100 °C). ¢, d, Hydrophilic sphere 
cooling in water at 100 °C, at the moment of the explosive transition from film 
boiling to nucleate boiling (c, T, = 275 °C) and during nucleate boiling 

(d, T, = 200 °C). Sphere cooling events in water at 22 °C are shown in 
Supplementary Movies 1 (hydrophilic sphere), 2 (hydrophobic sphere) and 3 
(superhydrophobic sphere). Supplementary Movie 4 shows hydrophilic and 
superhydrophobic spheres cooling in water at 100°C. 


textured superhydrophobic surfaces, no Leidenfrost transition was 
observed (Supplementary Figs 7, 8). 

We performed heating experiments to further characterize surface 
properties at the boiling transition. A cartridge heater placed inside a 
stainless-steel cylinder (Fig. 4a and Supplementary Fig. 9) was immersed 
in a water-filled beaker sitting on a hot plate that heated the water to 
slow boiling, T,, = 100 °C. The surface temperature of the cylinder, T,, 
was measured with an inserted thermocouple at constant heat flux, by 
controlling the applied power to the heater. We acquired high-speed 
video sequences along with the temperature measurements. 

Figure 4d shows the measured variations of heat flux with surface 
temperature at steady state for the four types of surface wettability. As 
expected for the low values of heat flux used here, the heat exchange 
occurred in the nucleate-boiling regime (Fig. 4b and Supplementary 
Fig. 13) for the superhydrophilic and hydrophilic surfaces. However, 
as in the sphere-cooling experiments, the superhydrophobic surface 
was able to sustain a vapour layer in the Leidenfrost regime for all 
surface superheats. This is readily observed in both the appearance of 
the vapour layer and the magnitude of the increase in surface temper- 
ature with increasing heat flux (Fig. 4d). 

When the temperature of the superhydrophobic cylinder was in 
equilibrium with the ambient water (T, = T,, = 100 °C), the cylinder’s 
surface appeared smooth and shiny. With increasing heater power, the 
appearance of surface ripples® accompanied a gradual transition to the 
Leidenfrost regime (Fig. 4c, Supplementary Fig. 10 and Supplementary 
Movie 5). Additional experiments with the immersion heater (see 
Supplementary Fig. 11) showed that for the superhydrophobic surface 
there was no hysteresis between heating and cooling experiments. 
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Figure 3 | Sphere temperature versus cooling time for 20-mm steel spheres 
held in water at 22 °C. a, Superhydrophilic sphere (contact angle <10°, SHL) 
and hydrophilic sphere (contact angle ~30°, HL). b, Hydrophobic sphere 
(contact angle ~100°, HB) and superhydrophobic surface sphere (contact 
angle >160°, SHB). Arrows indicate the Leidenfrost film-boiling regime, 
Leidenfrost-point transition (LP) and nucleate-boiling regime. The 
Leidenfrost transition in the superhydrophilic case occurs above the 
maximum temperature shown in the figure. In the superhydrophobic case, the 
Leidenfrost state relaxes to the Cassie state without passing through the 
nucleate-boiling regime. 


Keeping the surface immersed in water for up to 24 hours did not alter 
the smooth transition to the Leidenfrost regime (Supplemen- 
tary Fig. 11). Although in such cases the surfaces appeared matte 
instead of shiny, suggesting that the vapour layer was not in a pure 
Cassie state, on heating, they returned smoothly to the Leidenfrost 
state (Supplementary Fig. 12). The same recovery property of 
superhydrophobic surfaces has been demonstrated recently using 
electrophoresis”®. 

For the hydrophobic surface, it was possible to maintain the 
Leidenfrost vapour regime only if the cylinder was overheated before 
immersion to ~350°C, and the heater power was maximized on 
immersion. When the surface temperature was allowed to fall below 
~170°C, the Leidenfrost vapour layer collapsed, and the switched to 
the nucleate-boiling branch (Fig. 4d). 

Experiments with a stainless-steel cup placed on a hot plate showed 
the same behaviour as with the immersion heater (Supplementary 
Figs 14, 15). The good agreement among the cooling experiments 
and the two types of heating experiment confirm the geometry- 
independence and universality of the superhydrophobic surface effect 
in stabilizing the Leidenfrost vapour layer (compare Fig. 4d with 
Supplementary Figs 6 and 15). 

By combining results for cooling experiments at constant initial heat 
content and high initial temperatures, and heating experiments at 
constant heating power but over a range of lower superheat tempera- 
tures, we have shown that the mode of heat transfer at the solid-liquid 
interface can be controlled by a combination of the surface morpho- 
logy and surface chemistry of the solid surface. The observations for 
the hydrophilic and the hydrophobic surfaces are consistent with the 
standard transition between nucleate and film boiling”. The strong 
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Figure 4 | Surface temperature versus heat flux in heating experiments. 

a, Schematics of the immersion heater. b, c, High-speed camera snapshots of 
the heater in boiling (100 °C) water for the case of (b) hydrophobic surface in 
the nucleate-boiling regime, at surface temperature T, = 106 °C; and 

(c) superhydrophobic surface in the Leidenfrost regime, T, = 210°C. 

d, Dependence of the heat flux on the heater surface temperature for four types 
of surface in the nucleate-boiling and Leidenfrost film-boiling regimes. 
Abbreviations as in Fig. 3. 


effect of the superhydrophilic surface in increasing the Leidenfrost 
temperature and enhancing boiling heat transfer has also been investi- 
gated recently’. However, we now show that for textured super- 
hydrophobic surfaces the heat exchange always occurs in the 
presence of a Leidenfrost vapour layer, within the film-boiling regime. 
This behaviour has not been observed previously and represents a 
significant departure from the characteristic form of the classical boiling 
heat-transfer curve'®**” of heat flux versus surface superheat temper- 
ature. The currently accepted form of the boiling curve comprises a 
critical heat-flux maximum and a Leidenfrost-point minimum that 
characterizes the transition between nucleate and film boiling. But at 
a textured superhydrophobic surface, the heat flux is a smoothly 
increasing function of superheat temperature, owing to the vapour layer 
that is always maintained at such surfaces (Supplementary Fig. 17). 
The vapour layer on the superhydrophobic surface is sustained by 
the high surface roughness and porosity, coupled with the intrinsic, 
non-wetting hydrophobicity due to surface chemical treatment. These 
are the requisite conditions for the existence of a large non-wetting 
vapour layer (see Supplementary Information discussion on stabiliza- 
tion of vapour layers due to texture, and Supplementary Figs 18, 19). 
Furthermore, the three-dimensional nature of the surface roughness 
permits only small areas of direct contact between the peaks of the 
rough surface and the liquid interface. Contributions to the total heat 
transfer by such small areas are thus limited, and any onset of hetero- 
geneous nucleation of bubbles will simply be taken up in the surface 
vapour layer that in effect will contribute to a smooth transition to the 
Leidenfrost regime. These conclusions are in complete accord with 
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what we found when repeating Leidenfrost’s experiments’ of drop 
evaporation at heated surfaces. For hydrophobic surfaces we observed 
the classical transition from nucleate to film-boiling mode, whereas at 
superhydrophobic surfaces nucleate boiling is absent at all superheats 
(Supplementary Fig. 16). 

Although it has been demonstrated that the stability of the vapour 
layer in Leidenfrost drop experiments is sensitive to external perturba- 
tions such as vibrations, cooling rate and internal fluid circulation 
within the drops*’, and the hypothesis has been advanced that when 
the fluid drop contact angle at the surface attains the limiting value of 
180°, the film-boiling regime might be possible at all superheat tem- 
peratures’’, this study has demonstrated that with the combination of 
surface roughness and chemical modifications Leidenfrost vapour 
layer stability and the film-boiling behaviour are readily achievable 
and remain robust under general operational conditions. 

With the recent development of a wide variety of laboratory- 
designed and commercially available textured surface coatings with 
superhydrophobic”**™, superamphiphobic*”, anti-frost’ or switchable- 
hydrophobicity’ properties, the effect on thermal exchange shown 
here opens possibilities for new applications of such coatings, ranging 
from the design of efficient heat-exchange devices to technologies for 
aqueous drag reduction. 


METHODS SUMMARY 


In cooling experiments we used 20-mm-diameter stainless-steel spheres 
(FRITSCH GmbH). In heating experiments we used an immersion heater or a 
60-ml stainless-steel cup. The immersion heater is a 24-mm-diameter stainless- 
steel hollow cylinder with an electric heater cartridge fitted inside. All stainless- 
steel surfaces are modified as follows. Smooth hydrophilic surfaces are produced 
by cleaning with organic reagents. Smooth hydrophobic surfaces are produced by 
silanization with trichloro(1H,1H,2H,2H-perfluorooctyl)silane. Textured super- 
hydrophobic surfaces are produced by treatment with a commercial superhydro- 
phobic coating agent (Glaco Mirror Coat ‘Zero’, Soft 99 Co.). Textured 
superhydrophilic surfaces are produced by plasma cleaning of the surfaces previ- 
ously treated with the superhydrophobic agent. A 2-mm hole is drilled into the 
spheres, the immersion-heater cylinder wall or the bottom of the cup vessel to 
allow the insertion ofa thermocouple probe to measure the surface temperature. In 
cooling experiments, the heated sphere is quenched in a water-filled vessel and the 
sphere temperature is recorded as function of time. In heating experiments, the 
cylindrical heater is immersed in a water-filled glass beaker placed on a hot plate to 
control the water temperature, and the heater surface temperature is measured as a 
function of the applied power to the heater cartridge in the cylinder. The heat 
transfer process on the sphere surface or the immersion-heater surface is recorded 
using a high-speed video camera (Photron Fastcam SA-5). In the cup vessel experi- 
ments, the water-filled cup is placed on a hot plate and the vessel surface temper- 
ature is measured as a function of the hot-plate temperature. The same setup is 
used to measure the evaporation time of water droplets placed on the bottom of the 
cup. Further details of methods are available in Supplementary Information. 
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Human behaviour is thought to spread through face-to-face social 
networks, but it is difficult to identify social influence effects in 
observational studies’-'*, and it is unknown whether online social 
networks operate in the same way'* '”. Here we report results from 
a randomized controlled trial of political mobilization messages 
delivered to 61 million Facebook users during the 2010 US con- 
gressional elections. The results show that the messages directly 
influenced political self-expression, information seeking and real- 
world voting behaviour of millions of people. Furthermore, the 
messages not only influenced the users who received them but also 
the users’ friends, and friends of friends. The effect of social trans- 
mission on real-world voting was greater than the direct effect of 
the messages themselves, and nearly all the transmission occurred 
between ‘close friends’ who were more likely to have a face-to-face 
relationship. These results suggest that strong ties are instrumental 
for spreading both online and real-world behaviour in human 
social networks. 

Recent experimental studies®'*'° have attempted to measure the 
causal effect of social influence online. At the same time, there is 
increasing interest in the ability to use online social networks to study 
and influence real-world behaviour!”~?. However, online social networks 
are also made up of many ‘weak-tie’ relationships” that may not 
facilitate social influence’, and some studies suggest that online 
communication may not be an effective medium for influence”. An 
open question is whether online networks, which harness social 
information from face-to-face networks, can be used effectively to 
increase the likelihood of behaviour change and social contagion. 

One behaviour that has been proposed to spread through networks 
is the act of voting in national elections. Voter turnout is significantly 
correlated among friends, family members and co-workers in obser- 
vational studies****. Voter mobilization efforts are effective at increas- 
ing turnout”, particularly those conducted face-to-face and those that 
appeal to social pressure”® and social identity’. There is also evidence 
from one face-to-face field experiment that voting is ‘contagious’, in 
the sense that mobilization can spread from person to person within 
two-person households”. Although anecdotal accounts suggest that 
online mobilization has made a big difference in recent elections”, a 
meta-analysis of email experiments suggests that online appeals to vote 
are ineffective”. 

Voter mobilization experiments”*** have shown that most methods 
of contacting potential voters have small effects (if any) on turnout 
rates, ranging from 1% to 10%. However, the ability to reach large 
populations online means that even small effects could yield behaviour 
changes for millions of people. Furthermore, as many elections are 
competitive, these changes could affect electoral outcomes. For 
example, in the 2000 US presidential election, George Bush beat Al 
Gore in Florida by 537 votes (less than 0.01% of votes cast in Florida). 
Had Gore won Florida, he would have won the election. 

To test the hypothesis that political behaviour can spread through 
an online social network, we conducted a randomized controlled trial 


6,14— 


with all users of at least 18 years of age in the United States who 
accessed the Facebook website on 2 November 2010, the day of the 
US congressional elections. Users were randomly assigned to a ‘social 
message’ group, an ‘informational message’ group or a control group. 
The social message group (n = 60,055,176) was shown a statement at 
the top of their “News Feed’. This message encouraged the user to vote, 
provided a link to find local polling places, showed a clickable button 
reading ‘I Voted’, showed a counter indicating how many other 
Facebook users had previously reported voting, and displayed up to 
six small randomly selected ‘profile pictures’ of the user’s Facebook 
friends who had already clicked the I Voted button (Fig. 1). The 
informational message group (1 = 611,044) was shown the message, 
poll information, counter and button, but they were not shown any 
faces of friends. The control group (n = 613,096) did not receive any 
message at the top of their News Feed. 

The design of the experiment allowed us to assess the impact that the 
treatments had on three user actions; clicking the I Voted button, 
clicking the polling-place link and voting in the election. Clicking 
the I Voted button is similar to traditional measures of self-reported 
voting, but here users reported their vote to their social community 
rather than to a researcher. We therefore use this action to measure 
political self-expression, as it is likely to be affected by the extent to 
which a user desires to be seen as a voter by others. In contrast, social 
desirability should not affect other user actions in the same way. 
Clicking the polling-place link took users to a separate website that 
helped them to find a polling location, and this action was not reported 
to the user’s social community. We therefore use this action to measure 
a user’s desire to seek information about the election. Finally, we used 
a group-level process to study the validated voting behaviour of 
6.3 million users matched to publicly available voter records (see 
Supplementary Information). 

We first analyse direct effects. We cannot compare the treatment 
groups with the control group to assess the effect of the treatment on 
self-expression and information seeking, because the control group did 
not have the option to click an I Voted button or click on a polling- 
place link. However, we can compare the proportion of users between 
the two treatment groups to estimate the causal effect of seeing the 
faces of friends who have identified themselves as voters (Fig. 1). Users 
who received the social message were 2.08% (s.e.m., 0.05%; t-test, 
P<0.01) more likely to click on the I Voted button than those who 
received the informational message (20.04% in the social message 
group versus 17.96% in the informational message group). Users 
who received the social message were also 0.26% (s.e.m., 0.02%; 
P<0.01) more likely to click the polling-place information link than 
users who received the informational message (Fig. 1). 

Although acts of political self-expression and information seeking 
are important in their own right, they do not necessarily guarantee that 
a particular user will actually vote. As such, we also measured the effect 
that the experimental treatment had on validated voting, through 
examination of public voting records. The results show that users 
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Figure 1 | The experiment and direct effects. a, b, Examples of the informational message and social message Facebook treatments (a) and their direct effect on 
voting behaviour (b). Vertical lines indicate s.e.m. (they are too small to be seen for the first two bars). 


who received the social message were 0.39% (s.e.m., 0.17%; t-test, 
P= 0.02) more likely to vote than users who received no message at 
all. Similarly, the difference in voting between those who received the 
social message and those who received the informational message was 
0.39% (s.e.m., 0.17%; t-test, P = 0.02), suggesting that seeing faces of 
friends significantly contributed to the overall effect of the message on 
real-world voting. In fact, turnout among those who received the 
informational message was identical to turnout among those in the 
control group (treatment effect 0.00%, s.e.m., 0.28%; P = 0.98), which 
raises doubts about the effectiveness of information-only appeals to 
vote in this context. 

These results show that online political mobilization can have a 
direct effect on political self-expression, information seeking and 
real-world voting behaviour, and that messages including cues from 
an individual’s social network are more effective than information- 
only appeals. But what about indirect effects that spread from person 
to person in the social network? Users in our sample had on average 
149 Facebook friends, with whom they share social information, 
although many of these relationships constitute “weak ties’. Past 
research indicates that close friends have a stronger behavioural effect 
on each other than do acquaintances or strangers”'"'**'. We therefore 
expected mobilization to spread more effectively online through 
‘strong ties’. 

To distinguish users who are likely to have close relationships, we 
used the degree to which Facebook friends interacted with each other on 
the site (see Supplementary Information for more detail). Higher levels 
of interaction indicate that friends are more likely to be physically 
proximate and suggest a higher level of commitment to the friendship, 
more positive affect between the friends, and a desire for the friendship 
to be socially recognized”’. We counted the number of interactions 
between each pair of friends and categorized them by decile, ranking 
them from the lowest to highest percentage of interactions. A validation 
study (see Supplementary Information) shows that friends in the highest 
decile are those most likely to be close friends in real life (Fig. 2a). 

We then used these categories to estimate the effect of the mobil- 
ization message on a user’s friends. Random assignment means that 
any relationship between the message a user receives and a friend’s 
behaviour is not due to shared attributes, as these attributes are not 
correlated with the treatment (see Supplementary Information). To 
measure a per-friend treatment effect, we compared behaviour in the 
friends connected to a user who received the social message to beha- 
viour in the friends connected to a user in the control group. To 
account for dependencies in the network, we simulate the null distri- 
bution using a network permutation method (see the Supplementary 
Information). Monte Carlo simulations suggest that this method 
minimizes the risk of false positives and recovers true causal effects 
without bias (see Supplementary Information). 
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Figure 2 shows that the observed per-friend treatment effects increase 
as tie-strength increases. All of the observed treatment effects fall outside 
the null distribution for expressed vote (Fig. 2b), suggesting that they are 
significantly different from chance outcomes. For validated vote 
(Fig. 2c), the observed treatment effect is near zero for weak ties, but 
it spikes upwards and falls outside the null distribution for the top two 
deciles. This suggests that strong ties are important for the spread of 
real-world voting behaviour. Finally, the treatment effect for polling 
place search gradually increases (Fig. 2d), with several of the effects 
falling outside the 95% confidence interval of the null distribution. 

To simplify the analysis and reporting of results, we arbitrarily 
define ‘close friends’ as people who were in the eightieth percentile 
or higher (decile 9) of frequency of interaction among all friendships in 
the sample (see the Supplementary Information). ‘Friends’ are all other 
Facebook friends who had less interaction. A total of 60,491,898 (98%) 
users in our sample had at least 1 close friend, with the average user 
having about 10 close friends (compared with an average of 139 friends 
who were not close). 

The results suggest that users were about 0.011% (95% confidence 
interval (CI) of null distribution —0.009% to 0.010%) more likely to 
engage in an act of political self-expression by clicking on the I Voted 
button than they would have been had their friend seen no message. 
Similarly, for each close friend who received the social message, an 
individual was on average 0.099% (null 95% CI -0.042% to 0.048%) 
more likely to express voting. 

We also found an effect in the validated vote sample. For each close 
friend who received the social message, a user was 0.224% (null 95% CI 
-0.181% to 0.174%) more likely to vote than they would have been had 
their close friend received no message. Similarly, for information- 
seeking behaviour we found that for each close friend who received 
the social message, a user was 0.012% (null 95% CI -0.012% to 0.012%) 
more likely to click the link to find their polling place than they would 
have been had their close friends received no message. In both cases 
there was no evidence that other friends had an effect (see 
Supplementary Information). Thus, ordinary Facebook friends may 
affect online expressive behaviour, but they do not seem to affect 
private or real-world political behaviours. In contrast, close friends 
seem to have influenced all three. 

The magnitude of these contagion effects are small per friend, but it 
is important to remember that they result from a single message, and in 
many cases it was not possible to change the target’s behaviour. For 
example, users may have already voted by absentee ballot before 
Election Day, or they may have logged in to Facebook too late to vote 
or to influence other users’ voting behaviour. In other words, all effects 
measured here are intent-to-treat effects rather than treatment-on- 
treated effects, which would be greater if we had better information 
about who was eligible to receive the treatment. 
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Figure 2 | The effect of mobilization treatment that a friend received on a 
user’s behaviour. a-d, A validation study shows that at increasing levels of 
interaction, Facebook friends are more likely to have a close real-world 
relationship (a; see also the Supplementary Information). As the interaction 
increases, so does the observed per-friend effect of friend’s treatment on a user’s 


Moreover, the scale of the number of users, their friendship 
connections and the potential voters in a given election is very large. 
We estimated the per-user effect (the per-friend effect multiplied by 
the average number of friends per user) and the total effect (the 
per-user effect multiplied by the total number of users) on the 
behaviour of everyone in the sample (see Supplementary Informa- 
tion). The results suggest that friends generated an additional 
886,000 expressed votes (+1.4%, null 95% CI —1.1% to 1.1%), 
and close friends generated a further 559,000 votes (+0.9%, null 
95% CI -0.3% to 0.3%). In the Supplementary Information we also 
show that close friends of close friends (2 degrees of separation) 
generated an additional 1 million expressed votes (+ 1.7%, null 95% 
CI -0.8% to 0.9%). Thus, the treatment clearly had a significant impact 
on political self-expression and how it spread through the network, 
and even weak ties seem to be relevant to its spread. 

However, the effect of the social message on real-world validated 
vote behaviour and polling-place search was more focused. The results 
suggest that close friends generated an additional 282,000 validated 
votes (+1.8%, null 95% CI -1.3% to 1.2%) and an additional 74,000 
polling-place searches (+0.1%, null 95% CI -0.1% to 0.1%), but there 
is no evidence that ordinary friends had any effect on either of these 
two behaviours. In other words, close friendships accounted for all of 
the significant contagion of these behaviours, in spite of the fact that 
they make up only 7% of all friendships on Facebook. 

To put these results in context, it is important to note that turnout 
has been steadily increasing in recent US midterm elections, from 
36.3% of the voting age population in 2002 to 37.2% in 2006, and to 
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expressed voting (b), validated voting (c) and polling-place search (d). Blue 
diamonds indicate the observed treatment effect. Horizontal grey bars show the 
null distribution derived from simulations of identical networks in which the 
topology and incidence of the behaviour and treatment are the same but the 
assignments of treatment are randomly reassigned. 


37.8% in 2010. Our results suggest that the Facebook social message 
increased turnout directly by about 60,000 voters and indirectly 
through social contagion by another 280,000 voters, for a total of 
340,000 additional votes. That represents about 0.14% of the voting 
age population of about 236 million in 2010. However, this estimate 
does not include the effect of the treatment on Facebook users who 
were registered to vote but who we could not match because of 
nicknames, typographical errors, and so on. It would be complex to 
estimate the number of users on Facebook who are in the voter record 
but unmatchable, and it is not clear whether treatment effects would be 
of the same magnitude for these individuals, so we restrict our estimate 
to the matched group that we were able to sample and observe. This 
means it is possible that more of the 0.60% growth in turnout between 
2006 and 2010 might have been caused by a single message on 
Facebook. 

The results of this study have many implications. First and foremost, 
online political mobilization works. It induces political self-expression, 
but it also induces information gathering and real, validated voter 
turnout. Although previous research suggested that online messages 
do not work””, it is possible that conventional sample sizes may not 
be large enough to detect the modest effect sizes shown here. We 
also show that social mobilization in online networks is significantly 
more effective than informational mobilization alone. Showing 
familiar faces to users can dramatically improve the effectiveness of 
a mobilization message. 

Beyond the direct effects of online mobilization, we show the 
importance of social influence for effecting behaviour change. Our 
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validation study shows that close friends exerted about four times 
more influence on the total number of validated voters mobilized than 
the message itself. These results are similar to those from a prior 
network simulation study based on observational data that suggested 
each act of voting on average generates an additional three votes as this 
behaviour spreads through the networ! 30. Thus, efforts to influence 
behaviour should pay close attention not only to the effect a message 
will have on those who receive it but also to the likelihood that the 
message and the behaviour it spurs will spread from person to person 
through the social network. And, in contrast to the results for close 
friends, we find that Facebook friends have less effect. Online 
mobilization works because it primarily spreads through strong-tie 
networks that probably exist offline but have an online representa- 
tion. In fact, it is plausible that unobserved face-to-face interactions 
account for at least some of the social influence that we observed in this 
experiment. 

More broadly, the results suggest that online messages might 
influence a variety of offline behaviours, and this has implications 
for our understanding of the role of online social media in society. 
Experiments are expensive and have limited external validity, but the 
growing availability of cheap and large-scale online social network 
data’ means that these experiments can be easily conducted in the 
field. If we want to truly understand—and improve—our society, 
wellbeing and the world around us, it will be important to use these 
methods to identify which real world behaviours are amenable to 
online interventions. 
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Impact of caloric restriction on health and survival in 
rhesus monkeys from the NIA study 
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Calorie restriction (CR), a reduction of 10-40% in intake of a 
nutritious diet, is often reported as the most robust non-genetic 
mechanism to extend lifespan and healthspan. CR is frequently 
used as a tool to understand mechanisms behind ageing and age- 
associated diseases. In addition to and independently of increasing 
lifespan, CR has been reported to delay or prevent the occurrence 
of many chronic diseases in a variety of animals. Beneficial effects 
of CR on outcomes such as immune function’, motor coordina- 
tion’ and resistance to sarcopenia* in rhesus monkeys have recently 
been reported. We report here that a CR regimen implemented in 
young and older age rhesus monkeys at the National Institute on 
Aging (NIA) has not improved survival outcomes. Our findings 
contrast with an ongoing study at the Wisconsin National Primate 
Research Center (WNPRC), which reported improved survival 
associated with 30% CR initiated in adult rhesus monkeys 
(7-14 years)° and a preliminary report with a small number of CR 
monkeys®. Over the years, both NIA and WNPRC have extensively 
documented beneficial health effects of CR in these two apparently 
parallel studies. The implications of the WNPRC findings were 
important as they extended CR findings beyond the laboratory 
rodent and to a long-lived primate. Our study suggests a separation 
between health effects, morbidity and mortality, and similar to what 
has been shown in rodents’, study design, husbandry and diet 
composition may strongly affect the life-prolonging effect of CR 
in a long-lived nonhuman primate. 

For over 20 years, the NIA has studied the effects of CR in long-lived 
nonhuman primates (NHPs) (Macaca mulatta, average lifespan in 
captivity is ~27 years and maximum reported lifespan is ~40 years), 
to verify whether the life-prolonging effects observed in lower organisms 
also occur in monkeys and thus, might plausibly translate to human 
ageing'*'’. The NIA CR study began in 1987 at the NIH Animal 
Center’*. CR was initiated in monkeys of varying ages to evaluate the 
impact of age of onset of CR on its biological effects. Study design has 
been reported elsewhere'*"*. Male and female monkeys were enrolled 
into the study at young, middle and older ages’*. Data reported here 
are grouped as either young-onset (includes juvenile, adolescent and 
adult) or old-onset monkeys. Supplementary Table 1 reports the 
current census. 

Any animal that died underwent a complete necropsy by a board- 
certified pathologist. A gross description of the pathology related to 
each organ was provided along with the probable cause of death and 
any contributing factors. Survival data were analysed in two ways: 


all-cause mortality and age-related deaths; a distinction also reported 
previously’. In both studies (NIA and WNPRC), age-related survival 
excluded deaths due to acute conditions that do not have an age- 
related increase in risk such as gastrointestinal bloat, anaesthesia, 
injury or endometriosis. Pathology details are in the Supplementary 
Information. 

Old-onset CR monkeys (16-23 years) did not live longer than con- 
trols in either the all-cause (Fig. 1a) or age-related survival analysis 
(there were three cases of non-age related deaths in the CR group and 2 
in the control group, graph not shown). In this group, males had 
significantly longer survival compared to females (P = 0.0003) and 
neither sex benefitted from CR. To date, four CR monkeys and one 
control from the old-onset group have lived beyond 40 years. Although 
CR has not increased mean or maximum lifespan relative to control, 
50% survival for the females is 27.8 years and 35.4 years for the males, 
exceeding the ~27year median lifespan previously reported for 
monkeys in captivity'*. These monkeys may have benefitted from 
excellent husbandry conditions and thus CR started at older ages 
provided no additional increase in survival. Furthermore, there were 
no apparent differences in causes of death between the two diet groups. 
Neoplasia, cardiovascular disease, amyloidosis and general organism 
deterioration in the oldest animals were equally represented in both 
diet groups. 

Old-onset CR was beneficial on several measures of metabolic 
health and overall function. Both male and female CR monkeys 
weighed less than the control counterparts, although the diet effect 
was greater in the males. In longitudinal measures from serum of 
fasted monkeys, triglycerides, cholesterol and glucose levels increased 
with age for both male and female controls. However, triglycerides 
were significantly lower in the CR monkeys (F(21) = 5.76, 
P= 0.026) (Fig. 1b), and cholesterol remained significantly lower in 
the CR males (Fig. 1c) (F(40,774) = 1.53, P = 0.02). At the oldest ages, 
fasting glucose was numerically lower in the CR monkeys (Fig. 1d) and 
significantly lower in CR males compared to controls (P = 0.04). Ona 
single measure of plasma-free isoprostane, an indicator of oxidative 
stress, control males had significantly higher levels than the CR 
monkeys (23.24 + 1.25 versus 15.93 + 1.97pgml '; P= 0.009). In 
contrast, we previously reported that old-onset CR may negatively 
affect immune function”’. 

Current survival curves for the young-onset male and females are 
shown in Fig. 2a (all-cause mortality) and Fig. 2b (age-related mortality). 
No significant diet effects are noted in survival between control and CR 
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Figure 1 | Survival curve and triglycerides, cholesterol and glucose levels for 
old-onset monkeys. a, Kaplan-Meier survival curve for all-cause mortality for 
old-onset monkeys. All-cause mortality was analysed using Cox regression with 
age of onset, sex and diet as predictors. The effect of diet was not significant 
(P = 0.934) and sex was the only significant predictor (P = 0.003). Open circles 
represent alive monkeys. b, Fasting serum triglycerides (mg dl’) predicted 
from age-dependent individual-specific trajectories for old-onset monkeys. 
Triglyceride levels increased with age (F(16,162) = 2.12, P = 0.0096) and CR 
monkeys had significantly lower levels than control (F(,21) = 5.76, P = 0.026). 
F, female; M, male. Overall triglyceride trajectories were based on 243 
observations for 34 monkeys (50 observations for 8 control-F; 81 for 10 control- 
M; 32 for 7 CR-F; 80 for 9 CR-M). Age breakdowns for all figures are in the 
supplementary material. c, Cholesterol predicted from age-dependent 
individual-specific trajectories for old-onset monkeys. Cholesterol levels 
increased with age (F(s3,774) = 1.54, P = 0.009), and male monkeys had 
significantly lower levels than females (Fi 24) = 23.60, P< 0.0001). A 
significant three-way diet-sex—age interaction (F(40,774) = 1.53, P = 0.02) 
indicated that cholesterol levels increased with age for control males whereas 
CR males tended to have a slight reduction in cholesterol. Thus, at older ages 
(>30 years), CR male monkeys have significantly lower cholesterol levels 
compared to controls. Overall cholesterol trajectories were based on 994 
observations for 28 animals (204 for 7 control-F; 301 for 7 control-M; 134 for 5 
CR-F; and 355 for 9 CR-Male). d, Fasting serum glucose (mg dl) levels 
predicted from age-dependent individual-specific trajectories for old-onset 
monkeys. Five glucose measurements above 100 mg dI' for one diabetic 
control-M were omitted to remove the influence of these outliers on the 
analyses and graphs. There were significant changes in glucose over time 
(Fo0,285) = 10.48, P< 0.0001) and males and females were significantly 
different in the trends over time (F(1g,285) = 3.58, P < 0.0001) with males having 
increases in glucose levels over time, whereas the glucose levels of the females 
slightly decreased. The overall CR difference was not significant, F(),22) = 1.18, 
P = 0.288, and the CR differences in trend over time were not significant, 
F20,285) = 1.23, = 0.2259. Additional analyses stratified by sex conditions 
showed that control males had significantly higher glucose levels compared to 
CR males, Fo, 14) = 5.27, P = 0.04. Overall glucose trajectories were based on 
387 observations for 34 monkeys (79 observations for 8 control-F; 131 for 10 
control-M; 48 for 7 CR-F; 129 for 9 CR-M). 


monkeys for either analysis. Statistical controls are described in 
Methods. Of the original 86 monkeys in the young-onset cohorts, 
24% (11/46) of the control animals and 20% (8/40) of the CR group 
died of age-related causes. The NIA findings contrast with the adult- 
onset study at WNPRC that demonstrated a beneficial CR effect in 
which 37% of the control monkeys had died from age-related causes 
compared to only 13% in the CR group. When accounting for all 
deaths, the trend persisted with 9 control and 13 CR animals dying 
of non-age related causes. Survival probabilities for all NIA age groups 
combined are shown in Supplementary Fig. 1a, b. 

Considering that just less than 50% of young monkeys are still alive, 
these data do not represent final lifespan curves in this study. On 
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Figure 2 | Survival curves and glucose and triglycerides levels for young- 
onset monkeys. a, b, Kaplan-Meier survival curves for all-cause (a) and age- 
related mortality (b) for young-onset monkeys. Both were analysed using Cox 
regression with age of onset, origin, sex and diet (P = 0.255 and P = 0.975, 
respectively) as predictors with none of these factors being statistically 
significant. Open circles represent monkeys that are still alive and non-age 
related deaths in b. c, Fasting serum glucose (mg dl") levels predicted from 
age-dependent individual-specific trajectories for young-onset monkeys. 14 
glucose measurements above 100 mg dl ' in diabetic monkeys (7 observations 
for 3 control-M; 5 for 2 CR-M; 2 for 1 control-F) were omitted to remove the 
influence of these outliers on the analyses and graphs. There were significant 
changes in glucose over time (F(ig,1,112) = 11.24, P< 0.0001), and males and 
females were significantly different in the trends over time (F(1g.1,112) = 1.98, 
P= 0.0088) with males having a larger increase in glucose levels over time. 
There was no significant difference due to diet group. Overall glucose 
trajectories were based on 1,260 observations for 81 monkeys (346 observations 
for 23 control-F; 350 for 20 control-M; 281 for 20 CR-F; 283 for 18 CR-M). 
d, Fasting serum triglycerides (mg dl’) predicted from age-dependent 
individual-specific trajectories for young-onset monkeys. There were 
significant changes in triglycerides over time (F(14,g43) = 17.59, P< 0.0001) and 
males and females were significantly different in the trends over time 
(Fo.4g43) = 5.36, P< 0.0001). Furthermore, there was a diet-sex interaction 
indicating that the overall effect of CR on triglycerides was significantly 
different for male and female monkeys (Fi; 6g) = 5.07, P = 0.0276). Specifically, 
CR males had lower triglycerides than control males. By contrast, CR females 
had higher triglyceride levels than control females. Overall triglyceride 
trajectories were based on 973 observations for 81 monkeys (266 observations 
for 23 control-F; 280 for 20 control-M; 213 for 20 CR-F; 214 for 18 CR-M). 


the basis of lifespan projections using the hazard function’®, 
most animals are projected to be dead 10 years from now and the 
estimated probability statistics indicates a likelihood of less than 0.1% 
chance that the overall survival outcome would favour the CR group. 
The probability that a significantly different effect on mean survival will 
emerge in the next 5-10 years of the study is very low; however, a 
potential effect on maximum lifespan cannot be ruled out. 

As there is a clear difference in CR effect on mortality between the 
colonies at NIA and WNPRC, further comparisons of these two 
longitudinal studies are warranted and planned. In an estimate of 
NIA’s current data (as of 1 December 2011) to the published WNPRC 
data summarized as of 22 February 2008 and reported in ref. 5, NIA 
monkeys, both control and CR, may have a lifespan advantage com- 
parable to the WNPRC CR monkeys. 

Although they eat less (Supplementary Table 2) and weigh less’, 
young-onset CR monkeys lack many of the expected CR benefits. 
Fasting serum glucose levels were not significantly lower in the CR 
monkeys compared to control (Fig. 2c), and only the CR males 
had somewhat lower triglycerides compared to respective controls 
(P= 0.051) (Fig. 2d). However, in a ligature-induced model of 


13 SEPTEMBER 2012 | VOL 489 | NATURE | 319 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


inflammation in the oral cavity’, we have shown an improved immune 
response in young-onset CR monkeys and beneficial effects in T cells 
isolated from adolescent-onset males’. 

The incidence of cancer was markedly improved in young-onset CR 
monkeys (P = 0.028 compared to controls); in fact, neoplasia has not 
been identified in any monkey from this group (Fig. 3a). In contrast, 
five of the six cases in young-onset control monkeys were considered 
the cause of death with a mean age at diagnosis of 22.8 + 1.7 years. 
Glucoregulatory function was also improved in CR monkeys (Fig. 3a). 
However, two cases of diabetes have been diagnosed in CR monkeys; 
thus, the prevention of obesity did not prevent the occurrence of 
insulin-dependent diabetes and further investigation of the aetiology 
of such cases is of interest. Interestingly, CR did not reduce the incid- 
ence of cardiovascular disease as was reported in the WNPRC colony. 
Our findings are based on tissue pathology because these diagnoses 
were identified after death. 

An analysis of first occurrence of age-related disease was done on 
the NIA monkeys using the same disease criteria as defined by the 
WNPEC study. These conditions included: cancer, diabetes, arthritis, 
diverticulosis and cardiovascular disease. Although age-related diseases 
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Figure 3 | Incidence and estimated proportions of age-related diseases. 

a, Incidence of three major age-related conditions. Age at diagnosis is 
represented with a ‘Y’ for young-onset monkeys and ‘O’ for old-onset monkeys. 
Animals may be represented more than once if multiple conditions existed. 
b, Estimated proportions for the first occurrence of any age-related disease in 
each monkey from the young-onset age group (males and females combined) 
statistically controlling for sex and sex—CR interaction. These conditions 
included: cancer, diabetes, arthritis, diverticulosis and cardiovascular disease. 
The difference between control and CR is not statistically significant, P = 0.06. 
Old-onset monkeys are not represented. 
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were detected in control monkeys at an earlier age than in CR monkeys, 
the incident curves were not significantly different (P = 0.06) (Fig. 3b). 

Considering that these two projects maintain high quality veterinary 
support in comparable experimental settings, study the same species of 
primates, and test the same intervention, what could account for the 
differences in survival outcome? 

A notable difference between the two studies is diet composition. 
The NIA-1-87 formulation (Labdiet, PMI Nutrition International) has 
a natural ingredient base whereas WNPRC diet is purified (Harlan 
Teklad). Although natural ingredient diets risk having some variation 
between batches, they contain components that may have an impact 
on health such as phytochemicals, ultra-trace minerals and other 
unidentified elements’’. In purified diets, each ingredient supplies a 
specific nutrient and each required mineral and vitamin is added as a 
separate component. Nutrient sources were also different. Protein was 
derived from wheat, corn, soybean, fish and alfalfa meal for the NIA 
diet, whereas the WNPRC diet protein source was lactalbumin. The 
NIA diet also contained flavonoids, known for their antioxidant 
activity, and fat from soy oil and the oils from the other natural ingre- 
dients (that is, corn, wheat and fish). Fish meal contains approximately 
8-12% fat and is rich in omega-3 fatty acids. The WNPRC study 
dietary fat was derived from corn oil. Carbohydrate content was also 
notably different; although both diets had 57-61% carbohydrate by 
weight, the NIA study diet was comprised primarily of ground wheat 
and corn, whereas the WNPRC study diet contained corn starch and 
sucrose. Indeed, the WNPRC diet was 28.5% sucrose, whereas the NIA 
study diet was only 3.9% sucrose. This latter point may be particularly 
important as a diet high in sucrose may contribute to the incidence of 
type II diabetes'*”. 

The NIA and WNPEC studies also approached vitamin and mineral 
supplementation differently. The NIA study used one diet for both CR 
and control monkeys, which was supplemented with an additional 
40% of the daily-recommended allowance to insure adequate nutrition 
for the CR monkeys. Thus, the NIA diet formulation super- 
supplemented the control monkeys. The WNPRC study fed two dif- 
ferent diets and only the CR monkeys were supplemented. 

Another important difference in study design was that the NIA 
study control monkeys were not truly fed ad libitum, unlike the 
WNPEC study. The regulated portioning of food for the NIA control 
monkeys may bea slight restriction and thus, largely prevented obesity. 
Ithas been reported that 10% CR increased lifespan in rats compared to 
ad libitum, even more than 25 and 40% CR”. The NIA control 
monkeys may experience survival benefits from this slight restriction. 

Calorie restriction effectively lowered body weight in the NIA and 
WNPRC monkeys (Supplementary Table 2)'*?'. However, WNPRC 
monkeys generally weighed more than corresponding NIA monkeys. 
For example, at 17 years of age, WNPRC males weighed approximately 
12% more than corresponding NIA males and the difference was 
approximately 18% for the females. Thus, the NIA monkeys may be 
in an optimal weight range. 

NIA monkeys originated from both China and India, and have 
greater genetic diversity compared to the strictly Indian colony at 
WNPRC. In rodent studies, genetic differences have affected the 
survival outcome in CR studies, even shortening it in some recombinant 
inbred mouse strains”. In genetically heterogeneous wild-caught mice, 
although hormonal and weight loss effects were consistent with CR, 
there was no overall mean longevity effect”*. It is apparent that the effect 
of CR is not straightforward, and genetic differences may have a larger 
role than has been considered to date. A final analysis which includes all 
monkeys and controls for genetic origin can address this confounding 
variable. 

Lastly, as in rodent studies”, the age of onset of the CR regimen 
for the two studies could certainly impact survival outcome as it has 
other measures. CR initiated in the youngest male monkeys delayed 
maturation” and slowed skeletal growth”’. Additionally, only the 
immune response of the adolescent males was improved by CR”. 
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In the first randomized trial in humans, 6 months of CR improved 
several biomarkers of ageing and improved cardiovascular health, 
suggesting a reduction in risk of age-related disease. However, a 
lifespan study in humans is improbable’’. Even a self-imposed CR 
regimen in lean individuals improved several metabolic, inflammatory 
and cardiovascular measures”. Current findings show that in 
nonhuman and human primates, CR evokes very similar metabolic, 
hormonal and physiological changes that are linked to longevity in CR 
rodents”. It will be valuable to continue to compare findings from 
ongoing monkey CR studies to dissect the mechanisms behind the 
improvement in health that occurred with and without significant 
effects on survival. 


METHODS SUMMARY 

Approval. This study was approved by the Animal Care and Use Committee of the 
NIA, NIH and was conducted in an AALAC-accredited facility. 

Diagnostics. In live animals, diagnostic evaluations were made on the basis of 
clinical presentation. Radiographs confirmed conditions of osteoarthritis; 
endoscopic evaluation of diverticulosis revealed hernia-like outpouching in the 
mucosa of the descending colon with trapped faecal material in the diverticula; 
diabetes was confirmed by consistent elevated fasting glucose and glucose 
response during an intravenous glucose tolerance test; surgical biopsy or removal 
of tumours confirmed neoplasia. Cardiovascular abnormalities such as myofibre 
loss and fibrosis were diagnosed at necropsy as well as death due to acute con- 
gestive heart failure. 

Blood sampling. For longitudinal measures, blood samples were obtained under 
ketamine (7-10 mg kg” ', intramuscular) or Telazol (3.5 mg kg”, intramuscular) 
anaesthesia following an overnight fast. Serum samples were stored at —80°C 
until analysed. Plasma free isoprostane samples were collected in 2005 and 
measured according to the description in ref. 29. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Animals. With the exception of six old-onset males, all monkeys had known 
birthdates. Estimated ages were assigned to these six based on dental archives 
and historical records. No monkey had been used in invasive experiments before 
procurement. After procurement, monkeys were initiated on the study after 
required quarantine. Food intake was considered ad libitum during this time. 
Husbandry has been described previously'*. NIA monkeys were fed a natural 
ingredient diet containing 56.9% carbohydrate, 17.3% protein and 5% fat. 
Statistical methods. A Fisher’s exact test was used to compare the incidence of 
neoplasia in the young-onset cohort. Analyses of age-associated diseases and 
mortality included all animals with known diagnoses or cause of death before 1 
December 2011. 

Twenty of the 26 adult-onset females were obtained from a military 
research facility, and 19 of these monkeys developed severe and rapidly progres- 
sing endometriosis. The twentieth monkey of this group died at the age of 12 years 
from renal necrosis. It seemed apparent that this cohort was differentially affected 
in terms of long-term health, and thus, an indicator variable that designated the 
source of this monkey group as ‘Aberdeen’ was created and was included in most 
analyses to control statistically for the effects of these animals on the outcomes of 
interest. 

To determine the effect of CR on the onset of age-associated diseases (morbidity) 
and mortality, a Cox proportional hazard’* regressions with sex and caloric restric- 
tion (CR), a sex—-CR interaction term, and a covariate to adjust for whether the 
animal was obtained from the Aberdeen site as predictors were used to estimate the 
survival and hazard functions. The proportional hazards (PH) assumption was 
tested by fitting a non-PH Cox regression with a CR-time interaction, which was 
not significant for either analysis, and thus, PH models were considered valid. 
Animals that died of non-age-related causes (for example, death from anaesthesia, 
gastrointestinal bloat) were censored in both the mortality and morbidity analyses. 
Their age at death was used as the time variable in the Cox regressions. For the 
morbidity analysis, the age at which the animal experienced its first age-related 
diagnosis was used as the time variable in the Cox regression. Animals that received 
a non-age-related diagnosis were censored and their current age was used as the 
time variable. Animals that died of an age-related cause without ever receiving an 
age-related diagnosis were not censored. The designation of ‘age-related’ was based 
on the same rationale and list of conditions as reported by WNPRC. Death was 
considered as their first age-related diagnosis and their age at death was used as the 
time variable. All analyses were performed in SAS PROC PHREG and likelihood 
ratio tests were computed to assess statistical significance. 

A linear mixed model” approach was used to estimate longitudinal trends in the 
data while accounting for the dependency in the data due to multiple observations 
per subject. SAS PROC MIXED was used to estimate the trends and group differ- 
ences among the repeatedly measured outcomes (for example, body weight, 
glucose, cholesterol and triglycerides across the years of measurement. The effects 
of CR on overall outcome levels and differences in longitudinal trends were tested 
by including diet main effect and diet-year interaction terms in the model. Male 
and female monkeys were analysed together and sex main effects and sex—diet and 


-year interactions were also included in the models. The young- and old-onset 
groups were analysed separately. Age at the first measurement (that is, starting 
age) was used as a covariate to control for differences in age among the animals 
within a given year of measurement and a lag-1 autoregressive process over time 
was assumed. For the young-onset group a covariate to adjust for whether they 
were obtained from the Aberdeen site was added. Outliers were screened and 
removed. Specifically, a few young animals had glucose levels substantially above 
200 mgdl~*. Also one old control male that was eventually diagnosed with 
diabetes had extremely high triglyceride levels ranging from 342 to 1,314mgdl ' 
and these values influenced the significance of some effects. Briefly, the linear 
mixed model approach estimates a growth trajectory for each individual animal 
(for example, individual change in weight over time), adjusting for covariates. 
Then a weighted composite of these individual trajectories is computed to show 
the average trend over the age of the animals in a particular group (for example, 
average weight of animals at varying ages for CR-M). The weights for these 
composites are based on the number of observations each animal contributes to 
the data. For example, animals that live longer will contribute more data, and 
therefore will get larger weights. To smooth the trends for plotting graphs, the 
predicted values from each individual trajectory was averaged, and loess trend 
lines were constructed. 

Competing risk. The analyses in this paper as well as in ref. 5 distinguished 
between age-related and all-cause mortality. To address the issue that the non- 
age-related deaths are associated to CR, a competing risks Cox proportional 
hazard regression models’* were conducted separately for the young-onset group 
(9 control and 13 CR non-age-related deaths) and old-onset (2 control and 3 CR 
non-age-related deaths). Briefly, a competing risks model treats the events as if 
age-related and non-age-related deaths are mutually exclusive and compared to 
neither event occurring (that is, animals still alive are censored). These events have 
competing risks in that ifan animal dies from a non-age-related cause they are no 
longer at-risk for an age-related death (and vice versa). For the old-onset animals, 
age at start of the experiment was not significantly related to non-age- (P = 0.188) 
or age-related deaths (P = 0.269). CR was not significantly related to non-age- 
(P = 0.260) or age-related deaths (P = 0.490). Also, sex was not significantly 
related to non-age- (P = 0.991) or age-related deaths (P = 0.053); however, this 
association of sex with age-related mortality is of marginal significance, which is 
consistent with the trend for males for have higher survival curves (see Figs 1a and 
2a, b). For the young-onset animals, age at start of the experiment was not sig- 
nificantly related to non-age- (P = 0.604) or age-related deaths (P = 0.653). Sex 
was not significantly related to non-age- (P= 0.790) or age-related deaths 
(P = 0.480). CR was not significantly related to non-age- (P = 0.147) or age- 
related deaths (P = 0.975). Also, the origin (Aberdeen) of the animal was not 
significantly related to age-related deaths (P = 0.513), and the relationship to 
non-age-related deaths was not statistically significant. This marginal P value 
(P = 0.0889) could suggest that origin may be a confounding factor. 


30. Littell, R.C., Milliken, G.A., Stroup, W. W., Wolfinger, R. D. & Schabenberger, 0. SAS 
for Mixed Models (SAS Institute Inc., 2006). 
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Increased proteasome activity in human embryonic 
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Embryonic stem cells can replicate continuously in the absence of 
senescence and, therefore, are immortal in culture’”. Although 
genome stability is essential for the survival of stem cells, proteome 
stability may have an equally important role in stem-cell identity 
and function. Furthermore, with the asymmetric divisions invoked 
by stem cells, the passage of damaged proteins to daughter cells 
could potentially destroy the resulting lineage of cells. Therefore, a 
firm understanding of how stem cells maintain their proteome is of 
central importance. Here we show that human embryonic stem 
cells (hESCs) exhibit high proteasome activity that is correlated 
with increased levels of the 19S proteasome subunit PSMD11 
(known as RPN-6 in Caenorhabditis elegans)** and a corresponding 
increased assembly of the 26S/30S proteasome. Ectopic expression of 
PSMD11 is sufficient to increase proteasome assembly and activity. 
FOX0O4, an insulin/insulin-like growth factor-I (IGF-I) responsive 
transcription factor associated with long lifespan in invertebrates®’, 
regulates proteasome activity by modulating the expression of 
PSMD11 in hESCs. Proteasome inhibition in hESCs affects the 
expression of pluripotency markers and the levels of specific markers 
of the distinct germ layers. Our results suggest a new regulation of 
proteostasis in hESCs that links longevity and stress resistance in 
invertebrates to hESC function and identity. 

Embryonic stem cells are unique among all stem-cell populations 
examined in that they do not seem to undergo replicative senescence’”. 
Because proteostasis is crucial for maintaining proper cell function*”, 
hESCs could provide a new model to define proteostasis regulation and 
its demise in ageing. Central to proteostasis is the ubiquitin proteasome 
system and we examined whether proteasome activity changes as 
hESCs differentiate into several cell lineages. To evaluate differences 
in the 26S/30S proteasome activity, we monitored the degradation of 
specific fluorogenic peptide substrates'®. We differentiated H9 hESCs 
into neural progenitors cells (NPCs) and observed a marked decrease 
in the chymotrypsin-like proteasome activity (Fig. 1a). Moreover, 
when NPCs were differentiated into neurons, we detected a further 
decrease in proteasome activity during the differentiation process that 
was observable after 2 weeks (Fig. la and Supplementary Fig. 1). 
Consistent with enhanced proteasome activity in hESCs, we found 
increased levels of polyubiquitinated proteins in differentiated cells 
compared with hESCs (Fig. 1b). Because hESCs are known to vary in 
their characteristics despite unlimited capacity of self-renewal", we 
differentiated a distinct hESC line, HUES-6 cells, and obtained similar 
results (Supplementary Figs 1 and 2). Proteasome inhibitors blocked 
activity from extracts of hESCs, NPCs and neurons (Supplementary 
Fig. 3), indicating that the increased peptidase activity was indeed 
due to the proteasome. In addition, the other two activities of the 
proteasome, the caspase-like and trypsin-like activities, were also 
increased in hESCs (Supplementary Fig. 4). Proteasome activity did 
not differ depending on the passage number (Supplementary Fig. 5). 


& Andrew Dillin! 


The decrease in proteasome activity was not a specific phenomenon 
associated with the neural lineage as differentiation into either 
trophoblasts or fibroblasts induced a similar decrease (Fig. 1c, d). 
Notably, hESCs lost their high proteasome activity in a continuous 
progressive manner during the differentiation process (Fig. 1c). 
Moreover, we examined other cell lines extracted from human tissues, 
such as astrocytes, BJ fibroblasts or immortalized HEK293T cells, and 
found that these cells also had lower proteasome activity compared with 
hESCs (Supplementary Fig. 6). We tested whether high proteasome 
activity in hESCs was associated with increased proliferation and found 
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Figure 1 | Increased proteasome activity in hESCs. a, Chymotrypsin-like (Z- 
Gly-Gly-Leu-AMC) proteasome activity (relative slope to H9 hESCs) is shown. 
hESCs (n = 11), NPCs (n = 13), neurons (n = 10); *P < 0.00001. 

b, Representative immunoblot of polyubiquitinated protein levels. f-actin is the 
loading control. Total protein was visualized by Coomassie staining. 

c, Chymotrypsin-like proteasome activity (relative slope to H9 hESCs) is 
shown. hESCs (n = 6), 2 days of differentiation into trophoblasts (n = 6), 

5 days (n = 6), 8 days (n = 7). hESCs versus 2 days, *P < 0.001; hESCs versus 5 
days, *P = 7.7 X 10 ’; hESCs versus 8 days, *P < 0.0001. d, Chymotrypsin- 
like proteasome activity (relative slope to H9 hESCs) is shown. hESCs (n = 3), 
fibroblasts (n = 3); *P < 0.001. e, Chymotrypsin-like proteasome activity 
(relative slope to BJ fibroblasts) is shown. BJ fibroblasts (n = 10), iPSC line 1 
(n = 10), iPSC line 2 (n = 9), H9 hESCs (n = 5). iPSC lines have increased 
proteasome activity compared with fibroblasts (*P < 0.0005), and no 
significant differences compared with H9 hESCs (iPSC line 1 versus hESCs, 
P= 0.11; iPSC line 2 versus hESCs, P = 0.29). Ina, c, d and e data represent the 
mean + s.e.m. All statistical comparisons were made by Student’s t-test for 
unpaired samples. 
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that hESCs and HEK293T cells had nearly identical proliferation rates, 
yet hESCs had higher proteasome activity (Supplementary Fig. 7). 
Induced pluripotent stem cells (iPSCs) can be derived from adult so- 
matic cells by forced expression of exogenous factors that promote cell 
reprogramming'**. iPSC lines are similar to embryonic stem cells in 
many respects, such as their gene expression patterns, proteome profile 
and potential for differentiation’*’’. However, the full extent of their 
similarity to embryonic stem cells is still being assessed’°. We analysed 
two iPSC lines that had been carefully validated to ensure similar 
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gene expression profiles, growth characteristics and developmental 
potentials to hESCs'’. We discovered that these iPSC lines derived 
from BJ fibroblasts display increased proteasome activity similar to 
hESCs (Fig. le), indicating that proteasomal activity can indeed be 
reprogrammed. 

The 26S/30S proteasome consists of a 20S core structure containing 
the proteolytic active sites and 19S cap structures that impart regu- 
lation on the activity of the holo-complex (26S, single and 30S, double 
capped)'®. Although 20S particles can exist in a free form, 20S particles 
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Figure 2 | Increased proteasome assembly in hESCs depends on PSMD11 
expression. a, Chymotrypsin-like proteasome activity (relative slope to H9 
hESCs + 0.025% SDS) is shown. n = 4-6; *P < 0.01. No significant differences 
(N.S.) were found among the different cells when SDS was added (hESCs + 
SDS versus NPCs + SDS, P = 0.25; hESCs + SDS versus neurons + SDS, 
P=0.09). b, Relative PSMD11 expression compared with H9 hESCs. hESCs 
(n = 10), NPCs (n = 6), neurons (n = 8); *P < 0.00001. ¢, e, Western blot 
analysis with antibodies against PPMA1-7, PSMD11 and PSMD1. f-actin is the 
loading control. d, Relative PSMD11 expression compared with BJ fibroblasts. 
BJ fibroblasts (n = 10), iPSC line 1 (n = 6), iPSC line 2 (n = 6); *P < 0.00001. 
f, Proteasome activity (relative slope to lentivirus non-targeting shRNA) is 
shown. Non-targeting shRNA (n = 10), PSMD11 shRNA 1 (n = 8), PSMD11 
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shRNA 2 (n = 6), PSMC2 shRNA 1 (n = 6), PSMC2 shRNA 2 (n = 3); 

*P < 0.01. g, Native gel electrophoresis followed by western blot with PSMA1- 
7 (20S subunit) or PSMD2 (19S subunit) antibodies. h, Chymotrypsin-like 
proteasome activity (relative slope to green fluorescent protein (GFP) 
overexpressing (OE) HEK293T cells) is shown. GFP(OE) (n = 4), 
PSMD11(OE) (n = 5); *P < 0.005. i, j, Native gel electrophoresis of HEK293T 
cell extracts followed by proteasome activity assay with chymotrypsin-like 
activity substrate Leu-Leu-Val-Tyr-AMC and immunoblotting with PSMD1 
(19S subunit) antibody are shown. Extracts were resolved by SDS-PAGE and 
immunoblotting for analysis of PSMD11 levels and B-actin loading control. In 
a, b, d, f and h, data represent the mean = s.e.m. All statistical comparisons 
were made by Student’s t-test for unpaired samples. 
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in their most physiological form are inactive, unable to degrade 
denatured proteins or cleave peptides’®. The 19S regulatory subunit 
is responsible for stimulating the 20S proteasome to degrade proteins, 
because ATPases of the regulatory particle open the 20S core, allowing 
substrates access to proteolytic active sites'’. SDS treatment of cell 
extracts, a condition that activates 20S particles by allowing gate open- 
ing”®, resulted in equivalent activities among all cell types (Fig. 2a and 
Supplementary Fig. 8). This result suggests that all cell types have an 
equal number of 20S particles, but hESCs have increased levels of active 
26S/30S proteasomes. We examined the expression of the different 19S 
proteasome subunits and observed that PSMD11 was the only 19S 
subunit to decrease as hESCs differentiated (Fig. 2b, c and Supplemen- 
tary Figs 9-12). Consistent with the hESCs results, we observed 
increased PSMD11 levels in iPSCs (Fig. 2d, e). Accordingly, decreased 
expression of PSMD11 in hESCs (Supplementary Table 1) reduced 
proteasome activity (Fig. 2f), demonstrating that the increased levels 
of this subunit in hESCs are crucial for increased proteasome activity. 
PSMD11 has a vital role in stabilizing the otherwise weak interaction 
between the 20S core and the 19S cap*, suggesting that hESCs might 
have more assembled proteasomes. We detected more 30S particles in 
hESCs compared with NPCs and neurons. Furthermore, as more 20S 
subunits are assembled into 30S particles, less free 20S is found in 
hESCs (Fig. 2g). Ectopic expression of PSMD11 was sufficient to 
increase proteasome activity and assembly in cells with relatively low 
proteasome activity (Fig. 2h, i). Moreover, knockdown of PSMD11 
resulted in fewer assembled proteasomes (Fig. 2)). 

The levels of RPN-6, the C. elegans orthologue of PSMD11, are 
increased in the long-lived gip-1 mutant. In this mutant, increased 
proteasome activity, rpn-6 expression and longevity are modulated 
by the forkhead box O (FOXO) transcription factor DAF-16. To 
examine whether FOXO transcription factors regulate proteasome 
activity in hESCs, we reduced expression of the closest human 
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Figure 3 | FOX04 regulates proteasome activity in hESCs. 

a, Chymotrypsin-like proteasome activity in H9 hESCs transiently infected 
with lentiviruses (LV; relative slope to non-infected cells) is shown. n = 19; 
*P < 0.00001. b, Chymotrypsin-like proteasome activity of stable-infected 
hESCs (relative slope to GFP cells) is shown. GFP (n = 7), FOXO4 shRNA 1 
(n = 6), FOXO4 shRNA 2 (n = 3), FOXO4 shRNA 3 (n = 6). GFP versus 
FOXO4 shRNA 1, **P < 0.01; GFP versus FOXO4 shRNA 2, **P < 0.01; GFP 
versus FOXO4 shRNA 3, ***P = 4.5 X 10 ®. All shRNAs target the 3’ 
untranslated region (UTR) of the human genes. c, Proteasome activity of 
transiently infected hESCs (relative slope to non-infected H9 hESCs) is shown. 
n = 7;non-infected cells versus FOXO4(OE) cells, *P = 0.41; non-infected cells 
versus FOXO4-AAA(OE) cells, *P < 0.05. d, Proteasome activity (relative 
slope to GFP hESCs) is shown. n = 4; FOXO4 shRNA 3 versus FOXO4 shRNA 
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daf-16 orthologues FOXO1, FOXO3 and FOXO4 (Supplementary 
Fig. 13 and Supplementary Table 2). Notably, we found that FOXO4 
was crucial to modulate proteasome activity in hESCs, whereas FOXO1 
and FOXO3, as well as a HSF1, had little or no effect on proteasome 
activity (Fig. 3a, b and Supplementary Fig. 14). As hESCs differentiated 
into neural cells, trophoblasts or fibroblasts, there was a corresponding 
decrease in FOXO4 expression (Supplementary Figs 15-17). 
Accordingly, this decrease in FOXO4 expression is reprogrammed 
from somatic cells to iPSCs (Supplementary Fig. 15). FOXO1 had a 
similar expression pattern to FOXO4 in H9 but not in HUES-6 hESCs 
(Supplementary Figs 15 and 16). Furthermore, reduction of FOXO4 
affected proteasome activity in the multipotent NPCs, which retain 
partial stem-cell character, but did not affect proteasome activity in 
differentiated neurons (Supplementary Fig. 18 and Supplementary 
Table 3). These results raised the question as to whether FOXO4 regu- 
lation of proteasome activity could be a general mechanism found in 
dividing cells. However, we found that FOXO4 was not required for 
proteasome activity regulation in BJ fibroblasts or HEK293T but rather 
seems to be specific to hESCs (Supplementary Fig. 19). FOXO4 tran- 
scriptional activity is inhibited by phosphorylation on Thr 32, Ser 197 
and Ser 262 sites, and once dephosphorylated, it translocates to the 
nucleus and induces target gene expression*’””. Expression of a con- 
stitutively active FOXO4 triple alanine mutant (FOXO4-AAA), but not 
wild-type FOXO4, resulted in further upregulation of proteasome 
activity in hESCs (Fig. 3c, Supplementary Fig. 20 and Supplemen- 
tary Table 4a, b). In addition, ectopic expression of FOXO4-AAA 
in FOXO4 short hairpin RNA (shRNA) cells partially restored 
proteasome activity of these cells (Fig. 3d and Supplementary 
Table 4c). We found that loss of FOXO4 resulted in reduced expression 
of PSMD11 in hESCs and in the multipotent NPCs, but did not affect 
PSMD11 in differentiated neurons (Fig. 3e, f and Supplementary 
Fig. 21). Knockdown of FOXO1, FOXO3 or HSF1 did not affect 
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3 + FOXO4-AAA, *P < 0.01. e, Top, knockdown of FOXO4 decreases 
expression of PSMD11 in transiently infected H9 hESCs (GFP versus FOXO4 
shRNA, ***P < 0.001; GFP versus FOXO4 shRNA 2, *P < 0.05; GFP versus 
FOXO4 shRNA 3, ***P < 0.001). GEP (1 = 15), FOXO4 shRNA (n = 19), 
FOXO4 shRNA 2 (n = 4), FOXO4 shRNA 3 (n = 5). Bottom, stable 
overexpression of the FOXO4-AAA mutant increases PSMD11 expression in 
H9 hESCs (GFP versus FOXO4(OE), P = 0.69; GFP versus FOXO4-AAA(OE), 
**D <().01). GEP (n = 7), FOXO4(OE) (n = 8), FOXO4-AAA(OE) (n= 7). 

f, Western blot analysis of PSMD11 levels. B-actin is the loading control. 

g, Proteasome activity (relative slope to GFP H9 hESCs) is shown. n = 4; GFP 
versus FOXO4 shRNA, *P < 0.01; GFP versus FOXO4 shRNA+ PSMD11(OE), 
P= 0.50. Data in a-e and g represent the mean + s.e.m. Statistical comparisons 
were made by Student’s t-test for unpaired samples. 
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Figure 4 | Acute proteasome inhibition affects pluripotency of hESCs. 

a, Real-time PCR analysis of pluripotency (OCT4 (also known as POUSF1), 
NANOG, SOX2, UTF1, DPPA4, DPPA2, ZFP42 and TERT), trophectodermal 
(CDX2), ectodermal (PAX6, FGF5), mesodermal (MSX1) and endodermal 
(AFP, GATA6, GATA4 and ALB) germ-layer markers. The graphs (relative 
expression to H9 hESCs plus dimethylsulphoxide (DMSO)) represent the 


the expression of PSMD11 in any cell type tested, including hESCs 
(Supplementary Tables 5-7). Moreover, overexpression of FOXO4- 
AAA increased PSMD11 levels in hESCs (Fig. 3e, f and Supplemen- 
tary Table 5). Ectopic expression of PSMD11 in FOXO4 shRNA hESCs 
rescued their proteasome activity, indicating that expression of 
PSMD11 by FOXO4 is sufficient to regulate proteasomal function 
(Fig. 3f, g). 

To test the effects of proteasome inhibition on hESC identity, we 
induced an acute proteasome inhibition in hESCs by using the MG- 
132 proteasome inhibitor. Notably, hESCs were more sensitive to 
proteasome inhibition than were NPCs or neurons, and we had to 
decrease the MG-132 concentration almost 100 times to avoid cell 
death and detachment of hESCs (data not shown). Notably, low con- 
centrations of MG-132 (62.5 nM) were sufficient to reduce proteasome 
activity and induce accumulation of polyubiquitinated proteins in 
hESCs (Supplementary Fig. 22). Even in the absence of differentiation 
treatment, we already observed that proteasome inhibition resulted in 
decreased levels of pluripotency markers and modified the levels of 
markers of the distinct germ and extraembryonic layers, while decreas- 
ing the expression of proteins involved in neurogenesis, such as PAX6 
and MSX1 (Fig. 4). 

Collectively, our results establish increased proteasome activity as 
an intrinsic characteristic of hESC identity. Our findings raise the 
intriguing question of why these cells need enhanced proteasome 
activity. One possibility is that hESCs cannot tolerate toxic, misfolded 
proteins, and increased proteostasis could be required to avoid hESC 
senescence and maintain an intact proteome for either self-renewal or 
the generation of an intact cell lineage. Alternatively, high proteasome 
activity may be tightly linked to other cellular process, such as trans- 
lation, to ensure future integrity of the proteome. In addition, our results 
indicate that an orthologue of DAF-16, a transcription factor that reg- 
ulates both lifespan and resistance to proteotoxic stress in invertebrates, 
crosses evolutionary boundaries and links hESC identity to invertebrate 
longevity modulation. It will be of particular interest to identify other 
genes of the proteostasis network regulated by FOXO4 in hESCs. In 
conclusion, our findings may trigger new advances in understanding 


mean = s.e.m., n = 12. Proteasome inhibition (62.5 nM MG-132 for 24h) in 
hESCs induces a decrease in pluripotency markers and modifies the levels of 
germ-layer markers. *P < 0.05; **P < 0.01; ***P < 0.001. b, Western blot 
analysis with antibodies against SOX2, PAX6, FGF5 and MSX1. f-actin is the 
loading control. Statistical comparisons were made by Student’s t-test for 
unpaired samples. 


hESC differentiation or cell reprogramming and open new possibilities 
for cell therapy by modulation of the proteostasis network. 


METHODS SUMMARY 


26S proteasome fluorogenic peptidase assays. The in vitro assay of 26S 
proteasome activities was performed as previously described’®. Cells were collected 
in proteasome activity assay buffer (50mM Tris-HCl, pH7.5, 250mM sucrose, 
5mM MgCh, 0.5mM EDTA, 2mM ATP and 1 mM dithiothreitol) and lysed by 
passing ten times through a 27-gauge needle. Lysate was centrifuged at 10,000g for 
10 min at 4 °C. Fifteen to twenty-five micrograms of total protein of cell lysates were 
transferred to a 96-well microtiter plate (BD Falcon) and incubated with fluorogenic 
substrate. Fluorescence (380 nm excitation, 460 nm emission) was monitored on a 
microplate fluorometer (Infinite M1000, Tecan) every 5 min for 1h at 37°C. 
Native gel immunoblotting of the proteasome. HEK293T cells were run on 3.5% 
native gels prepared in resolving buffer (90 mM Tris base, 90 mM boric acid, 5 mM 
MgCl, 0.5 mM EDTA and 1 mM ATP) with 5 mM ATP, 1 mM dithiothreitol and 
3.5% acrylamide from a 40% stock solution of acrylamide and bisacrylamide in a 
37.5:1 ratio (Bio-Rad). These were run at 110 V for 3 hat 4 °C. Activity assays were 
performed by incubating the gels in activity assay buffer for 20 min at 37 °C and 
developed using a BioRad Gel Doc with ultraviolet illumination. Before transfer, 
the gels were incubated in transfer buffer (25 mM Tris base and 192 mM glycine) 
with 1% SDS for 10 min followed by a 10-min incubation in transfer buffer. The 
protein was transferred to polyvinylidene difluoride (PVDF) membranes at 5 V for 
16h in transfer buffer using an Idea Scientific Genie Blotter. Western blot analysis 
was performed with anti-PSMD1 (Abcam) and analysed using the Odyssey system 
(LI-COR). Extracts were also analysed by SDS-PAGE to determine protein 
expression levels and loading control. 

A detailed description of all experimental methods, including hESC culture and 
differentiation, is provided in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

hESCs culture and differentiation. The human H9 (WA09) hESC line was 
obtained from WiCell Research Institute. The HUES-6 hESC line was obtained 
from the laboratory of D. Melton. hESC lines were maintained on a mitotically 
inactive mouse embryonic fibroblast (MEF) feeder layer in hESC medium, 
DMEM/F12 (Invitrogen) supplemented with 20% knockout serum replace- 
ment (Invitrogen), 1mM 1-glutamine, 0.1mM _ non-essential amino acids, 
B-mercaptoethanol and 10ng ml! bFGF (Joint Protein Central). When the 
co-culturing of hESCs with MEFs was not possible owing to interference with 
downstream assays, H9 hESCs were also maintained on Matrigel (BD Biosciences) 
using mTeSR1 (Stem Cell Technologies). When cultured on Matrigel, HUES-6 
cells were fed conditioned medium collected from cultured MEFs. hESC colonies 
were passaged using a solution of collagenase (1 mg ml) or dispase (2 mg ml‘), 
and scraping the colonies with a glass pipette. For our experimental assays, we used 
H9 hESCs passages 40-45 and HUES-6 hESCs passages 30-35. The human iPSC 
lines (control BJ-iPSC lines) were derived and characterized as previously 
reported” and cultured as described above for hESCs cells. 

Neural differentiation was performed as follows. hESCs grown on inactivated 
MEFs were fed N2/B27 medium (DMEM/F12-GlutaMAX (Invitrogen), N2 
(Invitrogen) B27 without retinoic acid (Invitrogen)) for 2 days before being treated 
with collagenase typeIV (1mgml ‘ in DMEM/F12) at 37°C for 1h. Once 
colonies lifted off the plate, they were gently washed and transferred to ultra- 
low attachment plates (Corning). Aggregates (embryoid bodies) were allowed to 
form and grown in suspension for 1 week in N2/B27 medium with medium 
changes as needed, roughly every other day. Embryoid bodies were then trans- 
ferred onto polyornithine (PORN)/laminin-coated plates in N2/B27 medium with 
1g ml‘ laminin (Invitrogen), where they were allowed to adhere and develop 
neural rosettes and projections. After 1 week, colonies were picked for NPC lines. 
Picked colonies containing rosettes or projections are dissociated with TrypLE 
(Invitrogen) for 5 min at 37 °C and plated onto PORN/laminin coated plates in 
NPC medium (DMEM/F12, N2/B27-RA (Invitrogen), 1 ug ml! laminin and 
20ng ml ' FGF2). The resulting monolayer culture was grown at a high density 
and split 1:3 every week. For our experimental assays, we used NPCs at passages 
10-14. 

For neuronal differentiation, NPCs were dissociated with TrypLE (Invitrogen) 
and plated into neuronal differentiation medium (DMEM/F12, N2/B27-RA 
(Invitrogen), 1pgml~' laminin, 20ngml~' BDNF (Peprotech), 20ngml' 
GDNF (Peprotech), 1 mM dibutyryl-cyclic AMP (Sigma) and 200 nM ascorbic 
acid (Sigma)) onto PORN/laminin-coated plates. For this study, cells were differ- 
entiated in 6-well plates, with approximately 2 < 10° cells per well. Cells were 
differentiated for 2-3 months, with weekly feeding of neuronal differentiation 
medium. 

Differentiation to fibroblast cells involved the formation of embryoid bodies as 
described above, but cultured in embryoid body medium (IMDM base medium 
supplemented with 15% FBS (Atlanta Biologicals), 0.1 mM non-essential amino 
acids and 1% glutamax (Invitrogen)) and maintained on ultra-low attachment 
plates with daily medium changes. One week later the floating embryoid bodies 
were plated on gelatin-coated plates and passaged at confluence three times before 
use. Alternatively, a non-embryoid body method was used involving the individu- 
alization of hESCs using Accutase (Millipore) and plating the cells at a density of 
2.5 X 10* cells per cm? in embryoid body medium containing ROCK inhibitor 
(Stemgent) at 101M. The cells were fed daily with straight embryoid body 
medium. At confluence, any areas still showing a stem-cell-like morphology were 
removed by aspiration and then passaged using Accutase. After three passages, the 
cells present had fibroblast morphology and were confirmed by PCR of 
lineage-specific markers. 

Trophoblast differentiation was performed as described previously using high 

levels of BMP4 (ref. 23). Keratinocyte differentiation was performed following the 
protocol established in ref. 24. BJ human fibroblasts (ATCC) were cultured in 
DMEM (Invitrogen) supplemented with 10% FBS and 0.1mM non-essential 
amino acids, and passaged with trypsin. Hippocampal and cerebellar astrocytes 
are from Sciencell. 
Generation of lentiviral vectors. The shRNA-expressing lentiviral vectors were 
generated by cloning the sequences described in Supplementary Table 8 into the 
pSIH1-copGFP vector (SBI Biosystems) to generate pLV-siHSF-1, pLV-siFOXO1, 
pLV-siFOXO3, pLV-siFOXO4, pLV-si3’UTR_1 FOXO4, pLV-si3’UTR_2 
FOX04 and pLV-si3’UTR_3 FOXO4. A control shRNA vector was generated 
by cloning the sequence 5'-CGTGCGTTGTTAGTACTAATCCTATTT-3' 
designed against the sequence of luciferase (SBI Biosystems) into the same vector 
to generate pLV-siLuc. The GFP-expressing vector was prepared from the third 
generation self-inactivating lentivirus”. Lentiviruses were packaged by transient 
transfection in 293T cells”. 
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Lentivirus (LV)-non targeting shRNA control, LV-shPSMD11_1 (clone 
number TRCN0000003948), LV-shPSMD11_2 (TRCNO0000003950), LV- 
shPSMC2_1(TRCN0000007181), LV-shPSMC2_2 (TRCN0000007183) in 
pLKO.1-puro-CMV-tGFP vector were obtained from Mission shRNA (Sigma). 
FOX04-overexpressing lentiviral constructs (FOXO4(OE)) were generated as fol- 
lows. Flag~FOXO4 construct was obtained from Addgene (plasmid 17549). PCR 
was performed to generate a product to be cloned into pLVX puro lentiviral 
plasmid (Clontech) using the XhoI/Smal sites. Forward primer (with 5’ Xhol site 
for cloning): 5'’-CGCGTACTCGAGATGGATCCGGGGAATGAGAATTCAGC 
CACAGAGGCTGCCGCGATCATAGAC-3’; reverse primer (with 3’ Smal site 
for cloning): 5’-CCGGAACCCGGGTCAGGGATCTGGCTCAAAG-3’. To 
generate FOXO4-AAA (Thr 32, Ser 197, Ser 262), site-directed mutagenesis of 
wild-type FOXO4 was performed by using Pfu Turbo. The primers used for 
site-directed mutagenesis were: Thr32Ala, forward, 5’-GTCCCCGCTCCT 
GTGCTTGGCCCCTTCC-3’; reverse, 5'-GGAAGGGGCCAAGCACAGGAG 
CGGGGAC-3'. Ser197Ala, forward, 5’-GCAAAGCCCCCCGCCGCAGAGCC 
GCAGCCATGGATAGCAGCAG-3’; reverse, 5'-CTGCTGCTATCCATGGCT 
GCGGCTCTGCGGCGGGGGGCTTTGC-3’. Ser262Ala, forward, 5'-GTCCAC 
GAAGCAGCGCAAATGCCAGCAGTGTCAGC-3’; reverse, 5'-GCTGACACT 
GCTGGCATTTGCGCTGCTTCGTGGAC-3’. 

PCR was performed with one set of primers at a time. DpnI was added to the 
PCR product for 2h at 37 °C before transformation of DH5a bacteria. Plasmid 
preps were sequenced before the next mutation introduced. 

PSMD11-overexpressing lentiviral construct (PMSD11(OE)) was generated as 

follows. Human PSMD11 complementary DNA was PCR-amplified and cloned 
into pLVX-Puro using Xhol and BamHI. Resulting constructs were transformed 
into One Shot Stbl3 Escherichia coli (Invitrogen). Constructs were sequence 
verified and thereafter transfected into packaging cells to produce high titre 
lentiviruses. 
Lentiviral infection of human stem cells. hESC colonies growing on Matrigel 
were incubated with mTesR1 medium containing 10 1M ROCK inhibitor for 1h 
and individualized using Accutase. Fifty-thousand cells were infected in 
suspension with 10,1 of concentrated lentivirus in the presence of 10 11M 
ROCK inhibitor. Cell suspension was centrifuged to remove virus, passed through 
a mesh of 40 UM to obtain individual cells, and plated back on a feeder layer of 
fresh MEFs in hESC media supplemented with 10 14M ROCK inhibitor. After a few 
days in culture, small hESC colonies arose. For the GFP and FOXO shRNA 
lentiviral stable lines, GFP-positive colonies were selected and manually passaged 
onto fresh MEFs to establish new hESC lines. For non-targeting shRNA lentivirus, 
PSMDI11 shRNA, PSMC2 shRNA, FOXO4(OE) shRNA, FOXO4-AAA(OE) 
shRNA and PSMD11(OE) shRNA stable lines, we performed 1 jig ml! puromycin- 
resistance selection during three days and then colonies were manually passaged 
onto fresh MEFs to establish new hESC lines. 

Transient infection experiments were performed as follows. hESC colonies 
growing on Matrigel were incubated with mTesR1 medium containing 10 1M 
ROCK inhibitor for 1h and individualized using Accutase. Ten thousand cells 
were plated on Matrigel plates and incubated with mTesR1 medium containing 
10 uM ROCK inhibitor for 1 day. Cells were infected with 2 ul of concentrated 
lentivirus. Plates were centrifuged at 800g for 1 h at 30 °C. Cells were fed with fresh 
media the day after to remove virus. NPCs were split as described above, and 
infected with 2 ,1l of concentrated lentivirus for 1 day. Neurons were infected after 
2 months of differentiation with 2 tl of concentrated lentivirus for 1 day. In all the 
cases, cells were collected for experimental assays after four days of infection. 
26S proteasome fluorogenic peptidase assays. The in vitro assay of 26S protea- 
some activities was performed as previously described’®. Cells were collected in 
proteasome activity assay buffer (50 mM Tris-HCl, pH 7.5, 250 mM sucrose, 5 mM 
MgCl, 0.5 mM EDTA, 2mM ATP and 1 mM dithiothreitol) and lysed by passing 
ten times through a 27-gauge needle attached to a 1-ml syringe. Lysate was cen- 
trifuged at 10,000g for 10 min at 4°C. Approximately 15-25 1g of total protein of 
cell lysates were transferred to a 96-well microtiter plate (BD Falcon), and the 
fluorogenic substrate was then added to lysates. To measure the chymotrypsin-like 
activity of the proteasome we used either Z-Gly-Gly-Leu- AMC (Enzo) or Suc-Leu- 
Leu-Val-Tyr-AMC (Enzo). We used Z-Leu-Leu-Glu-AMC (Enzo) to measure the 
caspase-like activity of the proteasome, and Ac-Arg-Leu-Arg-AMC for the 
proteasome trypsin-like activity. Fluorescence (380 nm excitation, 460 nm emis- 
sion) was monitored on a microplate fluorometer (Infinite M1000, Tecan) every 
5 min for 1h at 37 °C. Protein concentration of the cell homogenates was deter- 
mined using the BCA protein assay (Pierce). For SDS experiments, 0.025% SDS 
was added to cell lysates 5 min before the digestion assay. 

Native gel immunoblotting of the proteasome. hESCs (H9), NPCs and neurons 
were collected in proteasome activity assay buffer (50 mM Tris-HCl, pH 7.6, 5 mM 
MgCl, 0.5 mM EDTA, 5 mM ATP, 1 mM dithiothreitol and 10% glycerol supple- 
mented with Roche phosphatase inhibitors) and lysed by passing ten times 
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through a 27-gauge needle attached to a 1-ml syringe. Lysate was centrifuged at 
16,000g for 15 min at 4 °C. Fifteen micrograms of total protein was run ona 3-12% 
NativePAGE Bis-Tris gel (Invitrogen) in NativePAGE running buffer (Invitrogen) 
at 4°C for 1h at 150 V and then increased to 200 V for a further hour. Proteins 
were then transferred to a polyvinylidene difluoride (PVDF) membrane at 25 V for 
1 h in NativePAGE transfer buffer (Invitrogen) in an XCell II Blot module 
(Invitrogen). After transfer, the PVDF membrane was incubated for 20 min with 
8% acetic acid to fix the proteins and then dried. Western blot analysis was 
performed with anti-20S alpha 1-7 (Abcam) and anti-PSMD2 (Abcam). 
HEK293T cells were run on 3.5% native gels prepared in resolving buffer (90 mM 
Tris base, 90 mM boric acid, 5mM MgCl, 0.5 mM EDTA and 1mM ATP) with 
5mM ATP, 1 mM dithiothreitol and 3.5% acrylamide from a 40% stock solution of 
acrylamide and bisacrylamide in a 37.5:1 ratio (Bio-Rad). These were run at 110 V 
for 3h at 4°C. Activity assays were performed by incubating the gels in activity 
assay buffer for 20 min at 37°C and developed using a BioRad Gel Doc with 
ultraviolet illumination. Before transfer, the gels were incubated in transfer buffer 
(25mM Tris base and 192mM glycine) with 1% SDS for 10 min followed by a 
10-min incubation in transfer buffer. The protein was transferred to PVDF 
membranes at 5V for 16h in transfer buffer using an Idea Scientific Genie 
Blotter. Western blot analysis was performed with anti-PSMD1 (Abcam) and 
analysed using the Odyssey system (LI-COR Biosciences). Extracts were also ana- 
lysed by SDS-PAGE to determine protein expression levels and loading control. 
Western blot. For analysis of proteasome subunits, cells were collected in protea- 
some activity assay buffer supplemented with protease inhibitors (Roche) and 
lysed by passing ten times through a 27-gauge needle attached to a 1-ml syringe. 
Lysate was centrifuged at 10,000g for 10 min at 4 °C. Protein concentration of the 
cell homogenates was determined using the BCA protein assay (Pierce). For ana- 
lysis of transcription factor and polyubiquitinated proteins, cells were collected 
from tissue culture plates by cell scraping and lysed in protein cell lysis buffer 
(10 mM Tris-HCl, pH 7.4, 10 mM EDTA, 50mM NaCl, 50 mM Naf, 1% Triton 
X-100, 0.1% SDS supplemented with 2mM _ sodium orthovanadate, 1mM 
phenylmethylsulphonyl fluoride and Complete mini protease and PhosSTOP 
inhibitor cocktail mix) for 2h at 1,000r.p.m. and 4°C in a Thermomixer. 
Protein concentrations were determined with a standard Bradford protein assay 
(BioRad). Approximately 20-50 ug of total protein was separated by SDS- 
PAGE, transferred to nitrocellulose membranes (Whatman) and subjected to 
immunoblotting. Western blot analysis was performed with anti-FK1 (Enzo), 
anti-20S alpha 1-7 (Abcam), anti-proteasome 20S C2 (Abcam), anti-Rpt6 
(Biomol), anti-PSMD1 (Abcam), anti-PSMD2 (Abcam), anti-PSMD14 
(Abcam), anti-PSMB6 (Abcam), anti-PSMD11 (Novus), anti-FOXO4 (55D4) 


(Cell Signaling), anti-FOXO1 (C29H4) (Cell Signaling), anti-SOX2 (D6D9) 
(Cell Signaling), anti-FGF5 (Abcam), anti-MSX1 (Abcam), anti-PAX6 (Abcam) 
and anti-B-actin (Abcam). The affinity of the antibody to PSMD11 has been 
characterized by detecting a decrease at the protein levels with PSMD11 shRNA 
or an increase by ectopic expression of PSMD11. These experiments convincingly 
show differences in only one band and we ascribe any alteration of PSMD11 to this 
band. 

Coomassie staining. Protein lysates were separated by SDS-PAGE and visualized 
directly in the gel by Coomassie staining’*. Gels were incubated in fixing solution 
(50% methanol, 10% acetic acid and 100mM ammonium acetate) for 60 min, 
stained with 0.025% Coomassie dye in 10% acetic acid on a shaker overnight 
and destained twice in 10% acetic acid for 60 min. Gels were transferred to water 
and analysed with the Odyssey imager (Li-Cor Bioscience). 
Bromodeoxyuridine proliferation assay. Cells were incubated with media con- 
taining 10mM bromodeoxyuridine (BrdU) for 40 min. Cells were fixed with 
formaldehyde 4% in PBS for 15 min and washed in PBS. Before permeabilization, 
cells were incubated for 1 h in 2 N HC] at room temperature followed by extensive 
washes in PBS. Cells were permeabilized with 0.5% Triton X-100 in PBS for 10 min 
and blocked with 5% normal donkey serum in 1% PBS-BSA for 40 min at room 
temperature. Rabbit anti-BrdU antibody (ABD Serotech) was diluted in 1% PBS- 
BSA and used for overnight incubation followed by incubation with a biotinylated 
anti-rabbit secondary antibody (Vector) for a further 2h at room temperature. 
Finally, cells were incubated with streptavidin-AlexaFluor 568 (Jackson Immuno 
Research) for 1h. DAPI was used to visualize nuclei at a concentration of 0.5 1g 
ml * in PBS. 

RNA isolation and quantitative RT-PCR. Total RNA was extracted using 
RNAbee (Tel-Test Inc.). cDNA was created using the Quantitect reverse tran- 
scriptase kit (Qiagen). SybrGreen real-time qPCR experiments were performed as 
described in the manual using ABI Prism79000HT (Applied Biosystems) and 
cDNA at a 1:20 dilution. Data was analysed with the comparative 2AAC, method 
using B-actin and GAPDH as housekeeping genes. See Supplementary Table 9 for 
details about the primers used for this assay. 
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The quantum internet’ is predicted to be the next-generation 
information processing platform, promising secure communica- 
tion’’ and an exponential speed-up in distributed computation™*. 
The distribution of single qubits over large distances via quantum 
teleportation’ is a key ingredient for realizing such a global plat- 
form. By using quantum teleportation, unknown quantum states 
can be transferred over arbitrary distances to a party whose loca- 
tion is unknown. Since the first experimental demonstrations of 
quantum teleportation of independent external qubits®, an 
internal qubit’ and squeezed states*, researchers have progressively 
extended the communication distance. Usually this occurs without 
active feed-forward of the classical Bell-state measurement result, 
which is an essential ingredient in future applications such as com- 
munication between quantum computers. The benchmark for a 
global quantum internet is quantum teleportation of independent 
qubits over a free-space link whose attenuation corresponds to the 
path between a satellite and a ground station. Here we report such 
an experiment, using active feed-forward in real time. The experi- 
ment uses two free-space optical links, quantum and classical, over 
143 kilometres between the two Canary Islands of La Palma and 
Tenerife. To achieve this, we combine advanced techniques invol- 
ving a frequency-uncorrelated polarization-entangled photon pair 
source, ultra-low-noise single-photon detectors and entanglement- 
assisted clock synchronization. The average teleported state fidelity 
is well beyond the classical limit? of two-thirds. Furthermore, 
we confirm the quality of the quantum teleportation procedure 
without feed-forward by complete quantum process tomography. 
Our experiment verifies the maturity and applicability of such 
technologies in real-world scenarios, in particular for future 
satellite-based quantum teleportation. 

Significant progress has been made recently in the field of quantum 
communication based on optical free-space links’?””’, which poten- 
tially allow much larger propagation distances compared to the exist- 
ing fibre networks because of the lower photon loss per kilometre. To 
enable quantum communication on a global scale and among parties 
not having access to any fibre network, it is foreseeable that experi- 
ments will involve ground-to-satellite’*’** and inter-satellite links. 

Previous experiments focused on the distribution of quantum states 
of single photons or entangled photon pairs over optical free-space 
links. However, the realization of more sophisticated multiphoton 
quantum information protocols, such as quantum teleportation, 
remained an experimental challenge under real-world conditions. 
Quantum teleportation is based on the simultaneous creation of at 
least three photons, which for random photon pair sources reduces 
the count rate by several orders of magnitude compared to experi- 
ments using only two photons'*’. This decreases the signal-to-noise 
ratio and requires a long integration time, such that high system 


stability is necessary. Moreover, the complexity and environmental 
requirements of a quantum teleportation set-up are increased signifi- 
cantly compared to previous two-photon experiments, which provides 
significant experimental and technological challenges. The work pre- 
sented in ref. 18 was a significant achievement in long-distance 
quantum communication, harnessing the entanglement between dif- 
ferent degrees of freedom of a single photon. However, because the 
teleported qubit was not provided independently from the outside, the 
applicability of that scheme is limited. Most earlier experiments on 
teleportation of qubits and squeezed states** were in-laboratory 
demonstrations, and hence the communication distances were rather 
short. Although fibre-based teleportation has been demonstrated 
experimentally’, the maximum transmission distance is limited by 
the intrinsic photon loss in optical fibre, unless quantum repeaters are 
involved’*. In comparison to these previous studies, the experiment 
presented in this Letter achieves long-distance free-space teleportation 
of an independent quantum state, thus paving the way for satellite- 
based global quantum communication. 

Quantum teleportation relies on using both a quantum channel and 
a classical channel between two parties, usually called Alice and Bob° 
(here located in La Palma and Tenerife respectively; Fig. la). The 
quantum channel is used by Alice and Bob to share the entangled 
auxiliary state” 

1 

v2 
which is one of the four maximally entangled Bell states 
(|¥*) = (|A)IV) +|V)|H)) and |®*) = J; (|H)|H) +|V)|V))). 
|H) and | V) denote the horizontal and vertical polarization states. Alice 
and Bob share this entangled state, where photon 2 is with Alice and 


photon 3 is with Bob. Charlie provides the input photon 1 to be tele- 
ported to Alice in a general polarization state 

I$); =a|H), +B/V), (2) 
where « and f are complex numbers (|a|* + |f|? = 1), unknown to 
both Alice and Bob. 

Alice then performs a Bell-state measurement (BSM), projecting 
photons 1 and 2 randomly onto one of the four Bell states each with 
the same probability of 25%. As a consequence, photon 3 is projected 
onto the input state |), up to a unitary transformation (U), which 
depends on the outcome of the BSM. When Alice feeds the outcome of 
the BSM forward to Bob via the classical channel, he can implement 
the corresponding unitary operation in real time and thus obtain 
photon 3 in the initial state (equation (2)) of photon 1. If |W). is 
detected, then U corresponds to the identity operation, which means 
that Bob needs to do nothing. If, on the other hand, | y,, is detected, 
Bob has to apply a n phase shift between the horizontal and the vertical 
component of his photon 3. 


|P~ )o3 (IH)21V)3—|V)2lH)s) (1) 
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Figure 1 | Quantum teleportation between the Canary Islands La Palma and 
Tenerife over both quantum and classical 143-km free-space channels. 

a, Experimental scheme. Alice and Charlie are situated in La Palma, and Bob in 
Tenerife. Charlie prepares the teleportation input photon 1 in |@),, using a 
heralded single-photon (HSP) source with a trigger photon 0 (photons are 
indicated by black numerals on red circles). An Einstein-Podolsky-Rosen 
(EPR)” source generates an entangled pair of photons 2 and 3 in the state 

| ¥%~)»3. Alice then performs a Bell-state measurement (BSM) on photons 1 and 
2, and projects them onto two of the four Bell states (| %)15/| Y*)2), and sends 
the result via the classical feed-forward channel to Bob. Photon 3 is sent via the 
free-space quantum channel to Bob, who applies a unitary transformation 
(identity operation or m phase shift) on photon 3 depending on the BSM result 
and thus turns its state |); into a replica of the initial quantum state |),. 

b, Set-up. In La Palma, a frequency-uncorrelated polarization-entangled 


Our experiment was conducted between Alice’s transmitter station 
(at the Jacobus Kapteyn Telescope (JKT) of the Isaac Newton Group 
on La Palma) and Bob’s receiver station (at the Optical Ground Station 
(OGS) of the European Space Agency on Tenerife), separated by 
143 km, both at altitudes of about 2,400 m. Our experimental set-up 
is shown in Fig. 1b. At La Palma, near-infrared femtosecond pulses 
(with a central wavelength of 808 nm) emitted from a mode-locked 
Ti:sapphire laser, with a repetition rate of 80 MHz, were up-converted 
to blue pulses (with a central wavelength of 404 nm). They were used 
to generate two photon pairs via type-II spontaneous parametric 
down conversion (SPDC) in two nonlinear B-barium borate (BBO) 
crystals placed in sequence. The first SPDC source was aligned to 
emit the entangled auxiliary photon pairs (photons 2 and 3) in the 
|Y)o3 state**, equation (1), while the second crystal was a heralded 
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photon pair source generated photons 2 and 3 in BBO, (EPR/Alice) and a 
collinear photon pair source generated photons 0 and 1 in BBO, (HSP/Charlie). 
All single photons were coupled into single-mode fibres. For implementing the 
BSM, photons 1 and 2 interfered in a fibre beam splitter (FBS) followed by 
polarization-resolving single-photon detection (BSM/Alice). Photon 3 was 
guided to the transmitter telescope via a 100-m single-mode fibre and sent to 
Bob in Tenerife, where the unitary transformation was implemented using an 
electro-optical modulator (EOM) and photon 3’s polarization was measured. A 
real-time feed-forward operation was implemented by encoding the | ¥%*)1> 
BSM result in 1,064-nm laser pulses, which were then sent to Bob via the feed- 
forward channel. On Bob’s side, they were separated by a dichroic mirror (DM), 
detected with a photodetector (PD) and used to trigger the EOM to perform the 
required 7 phase shift operation. See main text for details. 


single-photon (HSP) source providing Charlie’s photon 1 to be tele- 
ported. That source delivered pairs of horizontally (photon 0) and 
vertically (photon 1) polarized photons in a product state. The detec- 
tion of photon 0 by the avalanche photodiode (APD) t served as a 
trigger to herald the presence of photon 1. All photons were spectrally 
filtered using interference filters (IF) and coupled into single-mode 
fibres for spectral and spatial mode selection. 

To realize the BSM, Alice’s photon 2 was overlapped on a fibre beam 
splitter (FBS) with the teleportation input photon 1, whose polariza- 
tion was arbitrarily prepared by Charlie using half- and quarter-wave 
plates (HWP and QWP). In each output port of the FBS, a polarizing 
beam splitter (PBS) was used to project these photons on either hori- 
zontal or vertical polarization. Fibre polarization controllers (FPCs) 
were used to compensate for unwanted polarization rotation induced 
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by the fibres. Our BSM can identify two out of the four Bell states, 
which is the optimum achievable with linear optics only”. For more 
details on our BSM, see Supplementary Information. While the BSM 
was being performed at Alice, photon 3 was guided to a 7-cm-aperture 
transmitter telescope through a 100-m-long single-mode fibre and 
then sent via a 143-km free-space quantum channel over to Bob in 
Tenerife. There, it was collected by the 1-m-aperture OGS telescope, 
and guided through its Coude path to Bob. 

In the first stage of our experiment, we only considered the cases 
where Alice detected |¥% ),. in the BSM, which results in photon 3 
being already in the state of the input photon, |¢),, and hence Bob was 
required to perform an identity operation, that is, to do nothing at all. 
We verified the success of the teleportation process by analysing the 
polarization state of photon 3, which was accomplished by a polariza- 
tion analyser, consisting of a quarter- and a half-wave plate and two 
free-space coupled Si-APDs (silicon avalanche photodiodes) placed in 
each output mode of a polarizing beam splitter. 

In the second stage of our experiment, we implemented a real-time 
feed-forward operation. When Alice obtained |wr),,, she sent this 
classical information to Bob. On receiving this information, Bob had 
to apply a 7 phase shift between the |H) and | V) components of photon 
3 to obtain the replica of the input state |@),. For further details on the 
feed-forward implementation, see Supplementary Information. 

The relevant events on Alice’s and Bob’s sides were recorded with 
separate time-tagging units each disciplined to the global positioning 
system (GPS)". First, Alice in La Palma identified the threefold coin- 
cidence events corresponding to the | Y~),, outcomes of the BSM. This 
was done using a coincidence logic circuit featuring two separate out- 
put signals realized with transistor-transistor logic (TTL) pulses. These 
TTL pulses were fed into a time-tagging unit which recorded the exact 
time and the BSM result into a binary file. Similarly, Bob fed either the 
signals of both detectors (stage 1, without feed-forward) or the coin- 
cidence between these signals and the BSM results sent via 1,064-nm 
laser pulses (stage 2, with feed-forward) into his time-tagging unit. 
After a measurement run was completed, both time-tagged data files 
were compared by cross-correlation, and the detection events asso- 
ciated with simultaneous detection of four photons originating from 
the same pump pulse were identified. 
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Figure 2 | State tomography results of the four quantum states without 

feed-forward over the 143-km free-space channel with the BSM outcome of 
| ¥%),2. The bar graphs show the reconstructed density matrices p for the four 
states teleported from Alice (La Palma) to Bob (Tenerife) over the 143-km free- 
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The real-life long-distance environment provided a number of 
challenges for the present teleportation experiment. These challenges 
resulted most significantly in the need to cope with an extremely low 
signal-to-noise ratio when using standard techniques—indeed, too low 
to perform a successful experiment. To enhance the signal-to-noise 
ratio to a level that made the experiment possible, we used the follow- 
ing advanced techniques: a frequency-uncorrelated polarization- 
entangled photon pair source****, ultra-low-noise large-active-area 
single-photon detectors at Bob”, and entanglement-assisted clock 
synchronization’*"*'’”. Whereas all these techniques have been imple- 
mented individually, our work is (to our knowledge) the first that 
combines all of them simultaneously and moreover in an outdoor 
environment. See Supplementary Information for details. 

First we present our results without feed-forward, where we 
only considered the BSM outcome |¥ ),.. The input state |) 
was always approximately one of the four ideal input states 
\Pssea)€{ 1H), V),IP) = (IH) +1V))/V2,|L) =(|H) —ilV))/V2}. We 
performed tomographic measurements on three consecutive nights, 
thereby accumulating data over 6.5h. Figure 2 shows the state tomo- 
graphy results of quantum teleportation. The measured density matrix p 
for each of these teleported states was reconstructed from the experi- 
mentally obtained data using the maximum-likelihood technique’. The 
fidelity of the teleported state is defined as the overlap of the ideal tele- 
ported state | (ideal) With the measured density matrix: f = (ideai| P|Pidead- 
For this set of states, the teleported state fidelities are measured to be 
f=0.890(42), 0.865(46), 0.845(27) and 0.852(37), yielding an average 
f =0.863(38), where digits in parentheses represent 1o uncertainties, 
for example, f = 0.863 + 0.038. During these measurements the link 
attenuation varied from 28.1 dB to 39.0 dB, which was mainly caused 
by rapid temperature change and strong wind. Despite such high loss in 
the quantum free-space channel, the classical average fidelity limit’ of 
2/3 was clearly surpassed by our observed fidelities, as shown in Fig. 3. 
(Note that our random sampling of the input states over the mutually 
unbiased bases states leads to the same classical limit as sampling over 
the whole Bloch sphere. Also, it does not matter whether the sampling is 
done over all six mutually unbiased basis states or over three states with 
one state per basis.) Therefore, we have explicitly demonstrated 
quantum teleportation over the 143-km free-space channel. 
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space channel. The wire grid indicates the expected values for the ideal cases. 
The data shown comprise a total of 605 fourfold coincidence counts in about 
6.5h. The uncertainties in state fidelities extracted from these density matrices 
are calculated using a Monte Carlo routine assuming Poissonian errors. 


13 SEPTEMBER 2012 | VOL 489 | NATURE | 271 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


1.0 


0.8 4 


0.6-| Classical limit: 66.7% 


State fidelity 


0.4 


0.24 


0.0 


IH) lV) IP) IL) IP) rr 


IR)er 


Input states 


Figure 3 | Summary of the state fidelity results for the teleported quantum 
states with and without feed-forward. Grey bars show the results obtained in 
quantum teleportation without feed-forward. Bob was informed via an internet 
connection when Alice’s BSM outcome was | YW )1, that is, those cases where 
he is not required to apply any operation to his photon. The uncertainties of 

these state fidelities are the same as presented in the main text. Red bars stand 
for the results obtained in quantum teleportation with feed-forward (FF). Bob 
was informed via the classical free-space link of Alice’s BSM outcomes of 

| yw"), >. The uncertainties of these state fidelities are derived from Poissonian 

statistics. All observed fidelities significantly exceed the classical average fidelity 
limit of 2/3 (66.7%). Error bars, +1o. 


The reconstructed density matrices of the teleported quantum states 
allow us to fully characterize the teleportation procedure by quantum 
process tomography. The four input states (Pideat = |Pidea)Pideall = 
|H) (HI, |V) (V|, |P) (Pl, |L) (L|) are transferred to the corresponding 
(reconstructed) output states p. We can completely describe the effect of 
teleportation on the input states pig. by determining the process 
matrix x, defined by p= >> i K=0XkFPideal 7k» Where the o; are the 
Pauli matrices with oo the identity operator. The process matrix x 
can be computed analytically from these four equations for the four 
different input and output states*. The ideal process matrix of quantum 
teleportation idea) has only one non-zero component, (Yideai)oo = 1, 
meaning the input state is teleported without any reduction in quality. 
Figure 4a and b show respectively the real and imaginary components of 
x for quantum teleportation based on our experimental results. The 
process fidelity of our experiment was fprocess = TY (Zideal%) = 0.710(42). 
This clearly confirmed the quantum nature of our teleportation experi- 
ment as it is five standard deviations above the maximum process 


Figure 4 | Quantum process tomography of quantum teleportation without 
feed-forward. a, b, The real (Re(yy,); a) and imaginary (Im(y;,); b) values of the 
components of the reconstructed quantum process matrix, with |, k = 0, 1, 2, 
and 3. The results of the state tomography of the four teleported states, | H), 

| V), |P), |L), are employed to reconstruct the process matrix of quantum state 
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fidelity of 0.5, which is the limit one can reach with a classical strategy 
where Alice and Bob do not share any entanglement as a resource. 

In the second stage of the experiment, we realized quantum teleporta- 
tion including real-time feed-forward of the BSM result over the 143-km 
classical channel. We set |P) and |R) states (|R) = (|H) +i|V)) //2) as 
input states for which the required phase shift between the | H) and the 
|V) components of photon 3 resulted in a 90° polarization rotation. 
However, for the |H) or |V) input state, feed-forward is irrelevant 
because a m phase shift would only result in a non-detectable global 
phase shift. Thus, the quality of teleportation of these states is already 
confirmed by our first-stage experiment. Realizing teleportation for the 
states |P) and |R) fully confirms the generality of the procedure, as these 
states belong to different mutually unbiased bases. In Tenerife, we 
analysed photon 3 in the eigenbasis of the input state, that is, the |P)/ 
|M) (|R)/|L)) basis when the input state was |P) (|R)). Here 
|M) =(|H) —|V))/ 2. The resultant fidelities of the teleported states 
are 0.760(50) and 0.800(37) for |P) and |R), respectively (red bars in 
Fig. 3). Both results are clearly above the classical fidelity bound. Note 
that in our experiment, the efficiency of the classical link was measured 
to be 21.3%, which was mainly due to amplitude fluctuations caused by 
atmospheric turbulence. 

Using the real-time feed-forward operation, we unambiguously 
experimentally demonstrated quantum teleportation from La Palma 
to Tenerife over a 143-km free-space channel. We note that from a 
conceptual perspective, real-time feed-forward is part of the original 
teleportation proposal’. Ultimately, the advantage of long-distance 
teleportation compared to just sending the quantum state itself may 
lie in the following future applications: if Alice and Bob can stockpile 
their entangled states beforehand (with the help of quantum memories), 
teleportation is advantageous if the quantum channel is of low quality or 
if Bob’s location is unknown to Alice. This is because Alice can broadcast 
the classical information with high quality and to arbitrary, or even 
unknown, locations’. (Also, the quantum repeater, which is of high 
importance for large-scale quantum networks, is based on teleportation 
in the form of entanglement swapping’’.) Specifically, this advantage 
also shows the usefulness of quantum teleportation even for quantum 
channels that allow only small transmission rates. We note that over the 
years teleportation sources have been markedly improved. We believe 
this development will continue and currently several new schemes are 
being pursued. 

Our work proves the feasibility of both ground-based and satellite- 
based free-space quantum teleportation. Our quantum teleportation 
set-up was able to achieve coincidence production rates and fidelities 
sufficient to cope with the optical link attenuation resulting from 
various experimental and technical challenges; such challenges will 
arise in quantum transmission between a ground-based transmitter 


teleportation. The operators o; are the identity (i = 0) and the x-, y-, and z-Pauli 
matrices (i = 1, 2, 3). For the ideal case, the only non-zero component of the 
process matrix of quantum teleportation, Zideab iS (Zideal)oo = 1, which is 
indicated by the wire grid. Our experiment clearly confirmed that 79 (identity 
operation) is indeed the dominant component and surpasses the classical limit. 
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and a low-Earth-orbiting satellite receiver’. In fact, some of the 
demands of our experiment were even more challenging than in 
satellite communication since there the atmospheric distances to be 
overcome are certainly shorter than the distance between Tenerife and 
La Palma. In addition, acquiring, pointing and tracking (APT) for 
diffraction-limited transmitter and receiver telescopes in space is a 
well-established technology. Therefore, our experiment represents a 
crucial step towards future quantum networks in space, which require 
space-to-ground quantum communication. During the finalization of 
our manuscript, a related work was reported”? that used a 150-Hz APT 
system, while our APT system operated at a few hertz, yet fast enough to 
fulfil the specific requirements of our experiment. The most significant 
distinction between ref. 33 and the experiment presented in this Letter 
is our implementation of an active feed-forward technique. The 
technology implemented in both experiments has certainly reached 
the required maturity for both satellite and long-distance ground com- 
munication. We expect that many of the features implemented here will 
be important building blocks for a new area of experiments. 


Received 21 May; accepted 14 August 2012. 
Published online 5 September 2012. 


1. Kimble, H. J. The quantum internet. Nature 453, 1023-1030 (2008). 
2. Nielsen, M. & Chuang, |. Quantum Computation and Quantum Information 
(Cambridge Univ. Press, 2000). 
3. Gisin, N. & Thew, R. Quantum communication. Nature Photon. 1, 165-171 (2007). 
4. Ladd, T. D. et al. Quantum computers. Nature 464, 45-53 (2010). 
5. Bennett, C. H. et al. Teleporting an unknown quantum state via dual classical and 
Einstein-Podolsky-Rosen channels. Phys. Rev. Lett. 70, 1895-1899 (1993). 
6. Bouwmeester, D. etal. Experimental quantum teleportation. Nature 390, 575-579 
(1997). 
7. Boschi, D., Branca, S., De Martini, F., Hardy, L. & Popescu, S. Experimental 
realization of teleporting an unknown pure quantum state via dual classical and 
Einstein-Podolsky-Rosen channels. Phys. Rev. Lett 80, 1121-1125 (1998). 
8. Furusawa, A. et a/. Unconditional quantum teleportation. Science 282, 706-709 
(1998). 
9. Massar, S. & Popescu, S. Optimal extraction of information from finite quantum 
ensembles. Phys. Rev. Lett. 74, 1259-1263 (1995). 

0. Hughes, R. J. eta/. Free-space quantum key distribution in daylight. J. Mod. Opt. 47, 
549-562 (2000). 

1. Rarity, J.G., Tapster, P.R., Gorman P. M. & Knight, P. Ground to satellite secure key 
exchange using quantum cryptography. N. J. Phys. 4, 82 (2002). 

2. Aspelmeyer, M. et al. Long-distance free-space distribution of quantum 
entanglement. Science 301, 621-623 (2003). 

3. Marcikic, |., Lamas-Linares, A. & Kurtsiefer, C. Free-space quantum key distribution 
with entangled photons. Appl. Phys. Lett 89, 101122 (2006). 

4. Ursin, R. etal. Entanglement-based quantum communication over 144 km. Nature 
Phys. 3, 481-486 (2007). 

5. Villoresi, P. et al. Experimental verification of the feasibility of a quantum channel 
between space and Earth. N. J. Phys. 10, 033038 (2008). 

6. Fedrizzi, A. et al. High-fidelity transmission of entanglement over a high-loss free- 
space channel. Nature Phys. 5, 389-392 (2009). 

7. Scheidl, T. et al. Feasibility of 300 km quantum key distribution with entangled 
states. N. J. Phys. 11, 085002 (2009). 

8. Jin, X.-M. et al. Experimental free-space quantum teleportation. Nature Photon. 4, 
376-381 (2010). 

9. Scheidl, T. et al. Violation of local realism with freedom of choice. Proc. Nat! Acad. 
Sci. USA 107, 19708-19713 (2010). 

20. Marcikic, |., de Riedmatten, H., Tittel, W., Zbinden, H. & Gisin, N. Long-distance 
teleportation of qubits at telecommunication wavelengths. Nature 421, 509-513 
(2003). 


LETTER 


21. Ursin, R. et a/. Quantum teleportation across the Danube. Nature 430, 849 (2004). 

22. Briegel, H.-J., Dur, W., Cirac, J. |. & Zoller, P. Quantum repeaters: the role of 
imperfect local operations in quantum communication. Phys. Rev. Lett 81, 
5932-5935 (1998). 

23. Einstein, A., Podolsky, B. & Rosen, N. Can quantum-mechanical description of 
physical reality be considered complete? Phys. Rev. 47, 777-780 (1935). 

24. Kwiat, P. G. etal. New high-intensity source of polarization-entangled photon pairs. 
Phys. Rev. Lett. 75, 4337-4341 (1995). 

25. Calsamiglia, J. & Littkenhaus, N. Maximum efficiency of a linear-optical Bell-state 
analyzer. Appl. Phys. B 72, 67-71 (2001). 

26. Kim, Y.-H., Kulik, S. P., Chekhova, M. V., Grice, W. P. & Shih, Y. Experimental 
entanglement concentration and universal Bell-state synthesizer. Phys. Rev. A 67, 
010301(R) (2003). 

27. Poh,H.S.,Lim,J., Marcikic, |., Lamas-Linares, A. & Kurtsiefer, C. Eliminating spectral 
distinguishability in ultrafast spontaneous parametric down-conversion. Phys. 
Rev. A 80, 043815 (2009). 

28. Yao, X.-C. et al. Observation of eight-photon entanglement. Nature Photon. 6, 
225-228 (2012). 

29. Kim, Y.-S., Jeong, Y-C., Sauge, S., Makarov, V. & Kim, Y.-H. Ultra-low noise single- 
photon detector based on Si avalanche photodiode. Rev. Sci. Instrum. 82, 093110 
(2011). 

30. White, A. G., James, D. F. V., Eberhard, P. H. & Kwiat, P. G. Nonmaximally entangled 
states: production, characterization, and utilization. Phys. Rev. Lett. 83, 
3103-3107 (1999). 

31. Zukowski, M., Zeilinger, A., Horne, M. A. & Ekert, A. K. “Event-ready-detectors” 
Bell experiment via entanglement swapping. Phys. Rev. Lett. 71, 4287-4290 
(1993). 

32. Aspelmeyer, M., Jennewein, T., Pfennigbauer, M., Leeb, W. R. & Zeilinger, A. Long- 
distance quantum communication with entangled photons using satellites. /EEE J. 
Sel. Top. Quantum Electron. 9, 1541-1551 (2003). 

33. Yin, J. et al. Quantum teleportation and entanglement distribution over 100- 
kilometre free-space channels. Nature 488, 185-188 (2012). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank the staff of IAC: F. Sanchez-Martinez, A. Alonso, 

C. Warden, M. Serra and J. Carlos; and the staff of ING: M. Balcells, C. Benn, J. Rey, 

O. Vaduvescu, A. Chopping, D. Gonzalez, S. Rodriguez, M. Abreu, L. Gonzalez; J. Kuusela, 
E. Wille and Z. Sodnik; and J. Perdigues of the OGS and ESA. X.-S.M., T.J., RU. and A.Z. 
thank S. Ramelow for discussions, P. Kolenderski for discussions on the SPDC source 
with the Bell-state synthesizer, S. Zotter for help during the early stages of the 
experiment, and R. Steinacker for meteorological advice. J.K. was supported by the EU 
project MALICIA. E.A. and V.M. thank C. Kurtsiefer and Y.-S. Kim for detector electronics 
design, J. Skaar for support, and the Research Council of Norway (grant No. 180439/ 
V30) and Industry Canada for support. This work was made possible by grants from the 
European Space Agency (contract 4000104180/11/NL/AF), the Austrian Science 
Foundation (FWF) under projects SFB F4008 and CoQuS, and the FFG for the QTS 
project (no. 828316) within the ASAP 7 program. We also acknowledge support by the 
European Commission, grant Q-ESSENCE (no. 248095) and the John Templeton 
Foundation. 


Author Contributions X.-S.M. conceived the research, designed and carried out the 
experiment, and analysed data. T.H., T.S. and D.W. carried out the experiment and 
analysed data. S.K., W.N., B.W. and A.M. provided experimental assistance during the 
early stage of the experiment. J.K. provided the theoretical analysis and analysed data. 
E.A. and V.M. developed the ultra-low-noise detectors. T.J. provided experimental and 
conceptual assistance, and conceived and developed the coincidence analysis code. 
R.U. conceived the research, planned and carried out the experiment and analysed 
data. A.Z. defined the scientific goals, conceived the research, designed the experiment 
and supervised the project. X.-S.M., T.H., T.S., J.K., R.U. and A.Z. wrote the manuscript 
with assistance from all other co-authors. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to X.-S.M. (Xiaosong. Ma@Univie.ac.at) 
or AZ. (Anton.Zeilinger@Univie.ac.at). 


13 SEPTEMBER 2012 | VOL 489 | NATURE | 273 


©2012 Macmillan Publishers Limited. All rights reserved 


of an appropriate promoter (Fig. 1d). 

Mascré and colleagues’ work sheds light on 
the heterogeneity of the proliferative cell popu- 
lations in mouse epidermis, and has increased 
our understanding of stem-cell biology. But 
does this knowledge apply to human tissues? 
Caution is required when inferring aspects 
of human tissue physiology from animal 
data. For instance, clonal analyses of several 
human squamous epithelia (tissues such as the 
epidermis, cornea and conjunctiva) have 
unambiguously shown the existence of 
self-renewing cells endowed with stem-cell 
properties, as well as non-self-renewing cells 
with differing capacities for multiplication, 
including canonical transient amplifying 
cells*”. Both cell types participate in the 
regeneration of these epithelia in the clinic’®. = 
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AaAPWNE 


The rainforest’s 


water pump 


An investigation of naturally occurring water recycling in rainforests finally 
marries the results of global climate models with observations. Alarmingly, it also 
suggests that deforestation can greatly reduce tropical rainfall. SEE LETTER P.282 


LUIZ E. 0. C. ARAGAO 


he humid tropics contain more than 

35% of global forests, covering an area 

of 11,564,000 square kilometres (ref. 1). 
Tropical trees can extract deep soil water and 
pump it back to the atmosphere through a 
process called evapotranspiration. Asa result, 
25-56% of the rainfall in these regions can be 
recycled within the ecosystem’. Large-scale 
climate models indicate that this mechanism 
— which maintains atmospheric moisture and 
so feeds far-inland regions with rainfall — may 
be weakened by the removal of existing tropi- 
cal forests’. Yet, paradoxically, an increase in 
local rainfall over deforested areas has been 
observed’, 

On page 282 of this issue, Spracklen et al.° 
report an analysis of tropical rainfall that com- 
bines models of atmospheric transport with 
satellite observations of rainfall and vegetation 
cover*. The authors conclude that, for more 
than 60% of pan-tropical land (that is, the trop- 
ics across all continents), air masses that have 
travelled over extensive vegetated surfaces can 
generate at least twice as much rainfall as air 
masses that have flowed over deforested lands. 
On the basis of their findings, the authors pre- 
dict a potentially widespread reduction of 
rainfall in the Amazon basin, if deforestation 


*This article and the paper under discussion® were 
published online on 5 September 2012. 


in the region were to continue at the rate’ that 
occurred from 1997 to 2002. 

Sustaining high rates of global human- 
population growth requires an increase in 
farmable land, energy and timber supplies. 
Without adequate planning, these needs 
tend to be met by cutting down rainforests. 
Because ecological processes and human 
activities (such as agriculture) depend on 
water provision, scientists have, for at least 
two decades, investigated the impact of tropi- 
cal deforestation on the hydrological cycle, 
especially in Amazonia. 

Pioneering studies** using global climate 
models (GCMs) found that vegetation removal 
can disrupt the water cycle. For example, 
models show that changes in land surface 
characteristics cause a reduction in evapo- 
transpiration rates and so induce basin-wide 
decreases in Amazonian rainfall’. Another 
simulation shows that removing around 
40% of the original forest cover could 
drive Amazonia into an irreversible, drier 
climate mode’. 

These findings, however, are at odds with 
observations. Studies indicate that rainfall over 
deforested areas is, in fact, higher than in adja- 
cent forests’. This is because increased surface 
heating in deforested areas induces upward 
air motion, reducing air pressure and drawing 
moist air from neighbouring forests into the 
openings. The moist air rises, and so generates 
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Figure 1 | Effect of deforestation on rainfall in the tropics. a, Much of the rainfall over tropical forests 
comes from water vapour that is carried by the atmosphere from elsewhere. But a large component 

is ‘recycled’ rain — water that is pumped by trees from soil into the atmosphere through a process 

called evapotranspiration’. Water exits from forests either as run-off into streams and rivers, or as 
evapotranspirated vapour that is carried away by the atmosphere. The atmospheric transport of water 
vapour into the forest is balanced by the exit of water in the form of vapour and run-off. b, Spracklen 
and colleagues’ analysis° suggests that deforestation reduces evapotranspiration and so inhibits water 
recycling. This decreases the amount of moisture carried away by the atmosphere, reducing rainfall in 
regions to which the moisture is transported. Decreasing evapotranspiration may also increase localized 


run-off and raise river levels. 


convective rainfall over the openings. 

The problem for scientists is that models 
and observations capture processes at different 
scales, and so are not directly comparable. But 
with deforestation rates reaching 50,000 square 
kilometres per year (an area approximately 
the size of England) in the humid tropics’, 
resolving this issue is crucial for strategically 
planning the conservation of tropical biomes 
and for maintaining the well-being of human 
populations. 

Enter Spracklen et al., who have beautifully 
reconciled the scale of observations with that 
of models. Specifically, for each 1° x 1° grid- 
cell across the pan-tropics, the authors have 
quantified the amount of rainfall generated by 
air masses that travelled over areas of varying 
leaf area index (LAI, the amount of leaf area 
per square metre of ground). The authors com- 
bined satellite observations of LAI and rainfall 
with the outputs of an atmospheric-transport 
model, which they used to estimate the trajec- 
tories of air masses in the 10 days before they 
arrived at each grid location. This analysis 
highlighted a clear connection between rainfall 
and the cumulative amount of LAI over which 
air masses travelled: for air masses exposed to 
low-to-medium vegetation cover, there is a 
nonlinear increase of 0.3 to 0.4 millimetres per 


day of rainfall for every additional LAI unit, 
with a subsequent saturation of rainfall over 
densely vegetated surfaces. 

Contrary to previous observations, Spracklen 
and colleagues’ results corroborate the large- 
scale rainfall-driving mechanisms described 
by GCMs. Simply put, densely vegetated sur- 
faces recycle water efficiently through evapo- 
transpiration (which averages around 3-4 
millimetres per day in Amazonian forests’), 
maintaining the specific humidity of the air 
masses over these regions (Fig. la). Air travel- 
ling over sparsely vegetated surfaces, however, 
loses moisture during continental transport 
because of reduced water recycling (Fig. 1b). 
Using a scenario of Amazonian deforestation’ 
for 2050 that assumes that the environmental 
policies of the early 2000s are continued into 
the future, the authors go on to demonstrate 
that forest loss is likely to reduce basin-wide 
rainfall by 12% during wet seasons, and by 21% 
during dry seasons. 

The projected decline in rainfall caused 
by deforestation in eastern and southern 
Amazonia (see Fig. 4 of the paper®) overlaps 
regions that, according to GCMs, already 
have a high probability of increased drought 
frequency by the end of this century, assum- 
ing that global temperature rises by about 3 °C 
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(ref. 10). The potential ecological and eco- 
nomic impacts of this overlap could be huge. 
Changes in regional climate could exacerbate 
drought-related tree mortality, which in turn 
would reduce carbon stocks, increase fire 
risk and lower biodiversity’! Such changes 
might also directly threaten agriculture, 
which generates US$15 billion per year in 
Amazonia”, and the hydropower industry, 
which supplies 65% of Brazil’s electricity”. 
Society should therefore take urgent action 
now, to curb tropical deforestation and avert 
future environmental problems. 

That said, Brazil is committed by its National 
Plan on Climate Change to limit historical 
deforestation rates by 80% by the year 2020 
(ref. 14), and so the scenario used by Spracklen 
et al. for their prediction is likely to over- 
estimate the extent and magnitude of rainfall 
reduction in the mid-twenty-first century. An 
explicit consideration of temporal changes in 
atmospheric circulation and vegetation char- 
acteristics is also needed to refine projections 
of rainfall. And to provide an independent test 
of the patterns proposed by Spracklen et al., 
current pan-tropical rainfall trends should be 
quantified. 

Nevertheless, the authors have presented 
the first observational assessment of the influ- 
ence of vegetation on tropical rainfall patterns 
at a spatial resolution and coverage compat- 
ible with the outputs of GCMs. Their cutting- 
edge methodology will allow observations to 
be used consistently to examine large-scale 
deforestation impacts on rainfall, and to 
refine and evaluate current models to support 
conservation planning in the tropics. m 
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Diversity, stability and resilience 
of the human gut microbiota 


Catherine A. Lozupone’, Jesse I. Stombaugh’, Jeffrey I. Gordon’, Janet K. Jansson** & Rob Knight'*° 


Trillions of microbes inhabit the human intestine, forming a complex ecological community that influences normal physi- 
ology and susceptibility to disease through its collective metabolic activities and host interactions. Understanding the 
factors that underlie changes in the composition and function of the gut microbiota will aid in the design of therapies that 
target it. This goal is formidable. The gut microbiota is immensely diverse, varies between individuals and can fluctuate 
over time — especially during disease and early development. Viewing the microbiota from an ecological perspective could 
provide insight into how to promote health by targeting this microbial community in clinical treatments. 


ost gut microbes are either harmless or of benefit to the 
M host. The gut microbiota protects against enteropatho- 

gens’”, extracts nutrients and energy from our diets*“, and 
contributes to normal immune function’. Disruptions to the normal 
balance between the gut microbiota and the host have been associated 
with obesity*”, malnutrition’, inflammatory bowel disease (IBD)*"°, 
neurological disorders” and cancer’”. Understanding how the gut 
microbiota affects health and disease requires a shift in focus from 
individual pathogens to an ecological approach that considers the 
community as a whole. 

The first step in understanding the symbiotic relationship between 
gut microbes and their host is to characterize the baseline healthy 
microbiota and the differences that are associated with disease. 
Large-scale projects such as the European Metagenomics of the 
Human Intestinal Tract (MetaHIT)” and the US Human Microbiome 
Project (HMP)"* have made substantial progress towards this goal. 
Once the healthy composition and functional states of gut micro- 
biota are understood, the features that, when disrupted, are associ- 
ated with disease can be determined. However, the complexity of 
the microbiota, and the variation between and within individuals 
complicates the definition of what this ideal state may be within a 
population or an individual. 

Ecological principles can aid in understanding the host-microbe 
interactions and specific functions of the gut microbiota. Improve- 
ments in sequencing methods and technologies such as proteomics 
and metabolomics, coupled with metabolic network modelling’*”®, 
show how host and environmental factors can affect gut microbial 
ecology over a lifetime. The composition, diversity and function of 
gut microbial communities have the potential to inform personalized 
nutritional and drug treatment strategies (Fig. 1). In this Review, we 
summarize the progress that has been made towards characterizing 
the diversity and function of microbial communities in the healthy 
human gut, describe the ways in which this ecosystem can go awry 
and discuss the prospects for restoring a degraded ecosystem. 


Taxonomic diversity in the healthy gut 

Until the advent of higher-throughput sequencing, even basic ques- 
tions — such as how many different species there are, how diverse 
their genetic make-up is, which features are ubiquitous and which 


are unique to an individual, and whether community function can be 
predicted from species present — remained unanswered. 

Culture-based studies suggest that all healthy adults share most 
of the same gut bacterial species — a core microbiota. For example, 
Escherichia coli can be isolated from most people. However, culture- 
independent sequencing studies (Fig. 2) have demonstrated a vast 
microbial diversity that is highly variable over time and across pop- 
ulations. Each of us harbours more than 1,000 ‘species-level’ phy- 
lotypes: clusters of sequences that have as much diversity in their 
small subunit ribosomal RNA genes as named species’. Most of 
these phylotypes are bacteria belonging to just a few phyla. In adults, 
Bacteroidetes and Firmicutes usually dominate the microbiota, 
whereas Actinobacteria, Proteobacteria and Verrucomicrobia, 
although found in many people, are generally minor constituents"® 
(Fig. 3). Methanogenic archaea (mainly Methanobrevibacter smithii), 
eukaryotes (mainly yeasts) and viruses (mainly phage) are also pre- 
sent’. Despite the consistency of these main components, their 
relative proportions and the species present vary markedly across 
individuals (Figs 3 and 4). Attempts to find a core set of species- 
level phylotypes in the adult microbiota have identified several main 
players, including Faecalibacterium prausnitzii, Roseburia intestinalis 
and Bacteroides uniformis”’, but in some individuals even these can 
represent less than 0.5% of the microbes present”. As the data set 
expands to include developing countries and a broader age range*”’, 
the concept that there is a core set of species in the microbiota is 
becoming more unlikely. 


Functional diversity in the healthy gut 

Knowing the composition of the microbial community alone does 
not necessarily lead to an understanding of its function. Functional 
information comes, in part, from studying cultured isolates that have 
well-characterized genome content and ex vivo phenotypes, and from 
sequencing community DNA. Functional screening by shotgun 
metagenomics relies on the sequencing of the total microbial com- 
munity DNA, including from uncultured members, and matching 
the sequences to known functional genes (Fig. 2). Identifying the 
genes involved in specific metabolic pathways can predict functional 
capabilities, but without messenger RNA, protein and metabolite pro- 
filing these are still predictions. The process of mapping short gene 
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sequence fragments to organisms and functions is improving as more 
human gut microbial genomes are sequenced and annotated”, and as 
more complementary ‘omics’ data sets become available”. 

Despite the highly divergent compositions of gut microbiota across 
individuals, the functional gene profiles are quite similar (Fig. 4). This 
similarity was reported in a study of 18 females who shared more 
than 93% of the enzyme-level functional groups, but few genus-level 
phylotypes”, and was confirmed in much larger populations in HMP 
and MetaHIT data’? 

The gut microbiota all contain pathways for central, carbohydrate 
and amino-acid metabolism”. However, not all pathways are rep- 
resented, and grouping genes into broad functional categories can 
conceal the finer differences in function between individuals. Vari- 
able functions that are restricted to a species or strain — including 
pathogenicity islands, vitamin and drug catabolism, motility and 
nutrient transporters — are possible targets for personalized diets 
and therapeutic strategies. 

Many genes are expressed only under specific conditions. Shot- 
gun sequencing approaches that measure levels of mRNA (metatran- 
scriptomics) or shotgun proteomics (metaproteomics) may uncover 
functional variation with factors such as disease or diet that DNA 
studies overlook. For example, genes involved in carbohydrate 
metabolism and energy generation have been found to be expressed 
as proteins at higher levels than predicted from metagenomic data, 
underscoring the importance of these processes in the gut”. 

The observation that we share a functional core microbiome 
but not a core microbiota can be related to macroecosystems. For 
instance, rainforests that look the same and have many of the same 
functions can be composed of different species that have indepen- 
dently evolved. The challenge of ecological studies of the gut is to 
understand which members of the community have similar func- 
tional niches and which ones can be substituted for one another. 
Although taxonomic composition is far more variable than func- 
tional gene composition, at a general process level at least, several 
studies have shown a correlation between the two*'*”*. Functional 
gene composition variation that is not well explained by taxonomic 
variation can indicate that phylogenetically unrelated taxa have 
converged on similar functions and may be crucial for identifying 
and understanding functional components associated with altered 
physiological states (Fig. 2). 


Factors that drive normal variation 

Having established that the healthy gut microbiota is highly variable, 
we must next understand why it varies, so that this information can 
be used to tailor therapies or clinical trials. For example, the extent 
to which members of the same family harbour similar microbes will 
determine whether the family history of microbiota-driven disease is 
informative. How much the microbiota varies with age or pregnancy 
should be taken into account when designing cohorts. The sensitiv- 
ity of the microbiota to external factors, such as diet, will inform 
strategies for treating microbiome-linked diseases. 


Age 

Marked changes in the gut microbiota occur early on in life, with an 
increase in diversity and stability over the first three years***”’ (Fig. 5). 
The maturation of the human microbiota is an example of ecological 
succession*”*”’, After the initial colonization, communities undergo 
consecutive changes in composition and function until a relatively stable 
climax community is established. 

The microbiota in infants is relatively volatile. Variation in 
microbial communities and functional gene repertoires between 
individuals is greater in infants than in adults. This has been seen in 
rural communities in Malawi and Venezuela, and in US metropoli- 
tan populations of European and African descent*. However, infant 
microbiomes do share characteristic properties across individuals 
and populations, both compositional (many bifidobacteria, and 
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Figure 1 | Maintaining our gut microbial lawn. In some ways, maintaining 
a healthy microbiota is like lawn care: severe interventions can take the 
ecosystem back to bare earth, so it has to be re-established. Although many 
people recover naturally, this is not guaranteed, and weed-like species that 
are adapted to perturbed ecosystems often run wild. Several strategies can 
be used to restore the ecosystem: reseeding with a few well-defined ‘good’ 
microbes (probiotics), adding compounds that are thought to specifically 
promote the growth of beneficial microbes (prebiotics) and transplanting 
the entire microbial ecosystem, for example from a stool sample (faecal 
bacteriotherapy). The weed stage could be bypassed by administering the 
treatment immediately after the disturbance before the weeds can establish 
themselves. An additional strategy (not shown) is to use specific drugs, such 
as narrow-spectrum antibiotics, that target undesirable members of the 
microbial community. Although we are beginning to learn what a healthy 
microbial community looks like, and to recognize signs of weed species, our 
understanding of which strategies for altering the microbiota work best, and 
predicting which will work for a given individual, is still in its infancy. 


lower species richness than in adults) and functional (more genes 
that encode enzymes involved in folate biosynthesis’). 

The microbiota of infants is affected by antibiotic use, whether 
they were breastfed and the method of delivery, although whether 
differences in the microbiota in early life affect the composition of the 
adult microbiota is not well understood”*”. However, differences in 
composition driven by factors in infancy may affect susceptibility to 
immunological diseases into adulthood, such as asthma and atopic 
diseases”’. A possible mechanism for this susceptibility has been dem- 
onstrated in germ-free mice that accumulate invariant natural-killer 
T (iNKT) cells in the colonic lamina propria and lungs. The mice 
were protected from the associated pathology of this accumulation 
if colonization with a conventional microbiota occurred in neona- 
tal, but not adult germ-free mice, indicating that infancy is a crucial 
time for contact with the microbiota’. Identifying the components 
of the human microbiota that shape our immune system in early life 
is now needed. 
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Figure 2 | Tools for evaluating microbiota diversity. Targeted sequencing 
of phylogenetically informative genes (usually small subunit (SSU) ribosomal 
RNA) or random sequencing of all genes (shotgun metagenomic sequencing) 
can be used to assess DNA extracted from faecal samples. Genome sequences 
from cultured isolates link these two data sets by indicating which species 
contain which genes, and therefore functions. Shotgun metagenomic data 
will become substantially more useful as the number of reference genomes 
continues to increase with additional strain sequencing efforts. SSU rRNA 
gene sequences are related to each other in the form of phylogenetic trees 
because related phylotypes (clusters of similar sequences defined by sequence 
similarity) generally have more similar functional attributes. Genes that 
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encode proteins that perform known enzymatic reactions are related using 
metabolic networks, because genes that participate in the same metabolic 
pathway can work together to produce a phenotype. Data from SSU rRNA 
targeted gene and shotgun metagenomic sequencing can be analyzed by the 
same suite of techniques (detailed in the table and with illustrations to the 
right). Combining analysis results from targeted and shotgun sequencing data 
from the same samples, and sometimes using genome sequences to relate the 
two, can provide additional insight (middle column) Combining targeted 
sequencing with mRNA, protein and metabolite level analyses will be crucial 
for moving from in silico predictions to direct measurements of expressed 
community properties. OTU, operational taxonomic unit. 
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Genetics, environment and diet 

How factors such as the environment and diet shape the human gut 
microbiota is still unclear, partly because these factors are often con- 
founded. Twins and mother-daughter pairs have more similar microbiota 
compositions than unrelated individuals, suggesting that there could be 
a genetic influence over the microbiota””’. However, monozygotic and 
dizygotic adult twins have equally similar microbiota, suggesting envi- 
ronment rather than genetics may drive familial similarities”. 

Populations can be separated by characteristic differences in 
the gut microbiota. For example, Italian children have a different 
microbiota from children from rural Africa*’, and both children and 
adults from the United States have a very different microbiota from 
populations in Malawi and the Amazonas state of Venezuela‘ (Fig. 5). 
Although genetically different, these populations also differ in other 
factors that could affect the microbiota, such as environmental expo- 
sures, provision of adequate sanitation and levels of cleanliness, diet 
and antibiotic use. Cultural factors, especially diet, may be crucial in 
shaping the gut microbiota. The ratio of two of the main genera of gut 
bacteria, Prevotella and Bacteroides, correlates well with the overall 
pattern of diversity across healthy adults (see discussion of entero- 
types later)*'*’. Levels of Prevotella were enriched in children from 
a rural African village who had a high-fibre diet*’, and in children 
and adults from Malawi and Venezuela whose diet was dominated 
by plant-derived polysaccharide foods such as maize and cassava. 
By contrast, the microbiota of those in the United States had more 
Bacteroides*. For healthy adults in the United States, differences in 
long-term diet between individuals also was correlated with these 
bacteria. Bacteroides was associated with a long-term diet rich in 
animal protein, several amino acids and saturated fats, and Prevotella 
was associated with carbohydrates and simple sugars”. The relative 
importance of changes in the microbiota as a result of diet, com- 
pared with other factors that affect health, remains a subject of active 
investigation. 

Understanding how cultural traditions affect the microbiota should 
highlight the factors that result in the marked differences in the inci- 
dence of diseases associated with microbiota. For example, the inci- 
dence of IBD and other allergies is greater in industrialized, Western, 
societies than in agrarian cultures”. Understanding these associations 
will require expanded studies that sample a greater number of popula- 
tions and control the confounding factors. This may come in the form 
of a human microbiome diversity project that parallels the Human 
Genome Diversity Project™. Especially valuable will be populations 
in which variables that correlate with microbiota diversity — such as 
history of antibiotic use, diet and environmental exposures — can be 
measured. Comparisons with environmental exposures may require 
sampling from the environment as well as from individuals. Because 
exposure to a microbiota in early life is particularly relevant to the 
development of immunological disease’, prospective studies that 
include infants or even prenatal studies, will be especially valuable. 
When determining disease aetiology, immigrant populations can pro- 
vide an opportunity to disentangle host genetics, geography and culture 
because the incidence of some diseases matches that present at the 
destination rather than at the origin”. 


Microbiota variability, and human health and medicine 

Understanding the compositional and functional differences in the 
gut microbiota can lay the foundation to relate these differences to 
human health. Differences in the microbiota and the microbiome 
could help to explain the variation in the gut metabolic processes of 
individuals, including the metabolism of drugs and food'’”*. Many 
of these metabolic pathways are outside the common functional core, 
so they can underlie host-specific responses. For example, the health 
benefits of diets rich in soya, such as improvements in vasomotor 
symptoms, osteoporosis, prostate cancer and cardiovascular disease, 
have been attributed to (S)-equol produced from the soya isoflavone 
daidzein by bacterial rather than human enzymes”. Only 25-30% of 
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the adult population of Western countries produce (S)-equol with a 
diet of soya-rich foods, compared with 50-60% of adults from Japan, 
Korea or China**. The cancer-protective effects of soya beans that 
have been described in Asian populations may therefore not be true 
of Western populations. Similarly, the gut microbiota can indirectly 
determine whether the widely used analgesic paracetamol will be 
metabolized to paracetamol sulphate or paracetamol glucuronide, 
potentially altering its efficacy and toxicity’. The microbes medi- 
ate this metabolic phenotype by producing the compound p-cresol, 
which competes with paracetamol for human enzymes that catalyse 
sulphonation”®. Substantial differences in the gut microbiota between 
populations and concomitant effects on drug metabolism will neces- 
sitate population-specific drug trials for examining effectiveness and 
toxicity. Understanding how the microbiota varies across the popula- 
tion, and correlating this variability with specific microbial functions, 
is emerging as a component of personalized medicine. 


Stable configurations of healthy microbiota 

The landscape of stable states for the human gut is still unknown. Sam- 
ples obtained over time from the same individual are more similar to 
one another than those obtained from different individuals, suggest- 
ing that each person has a relatively distinct, stable community”. 
This temporal stability could be related to the concept that stable 
equilibrium states exist for the microbiota, for which disturbances, 
stochasticity and temporal dynamics of individual microbes produce 
change, but the community is still drawn to a central attractor. 

Clinically, it would be convenient if a few clearly differentiable 
stable states could be used to stratify the gut microbiome across the 
human population. For example, if microbes in one state were known 
to metabolize a drug into harmful metabolites, a simple test could 
avoid complications for the patient, as in the case of paracetamol and 
liver toxicity’®. It would be especially helpful if the same states could 
be applied across the board so that, for example, the same community 
features that correlated with the generation of toxic metabolites from 
a drug also correlated with obesity. 

Enterotypes propose this concept. The human microbiome has 
been described to form three distinct host-microbial symbiotic states 
driven by groups of co-occurring species and genera, which are char- 
acterized by a relatively high representation of Bacteroides, Prevotella 
or Ruminococcus*'. Although the overall clustering structure, which 
was based on 33 faecal shotgun metagenomes, was not statistically sig- 
nificant, the three enterotype clusters described the real data better than 
randomly generated data sets”’. Similar patterns were reported after 
existing 16S rRNA data from 154 individuals from the United States” 
and metagenomic data from 85 individuals from Denmark"? were reana- 
lysed. However, subsequent studies of 98 healthy adults from the United 
States*’, 531 healthy infants, children and adults from Malawi, Venezuela 
and the United States*, and 250 healthy adults from the United States’ 
failed to show the same pattern. Variation among adults in these popula- 
tions was, however, associated with a trade-off between Prevotella and 
Bacteroides, suggesting an important role for these taxa, or taxa that co- 
distribute with them, in structuring the microbiota. When infants were 
included in the analysis, variation was also associated with a trade-off 
between the infant-associated Bifidobacterium genus and lineages that 
were common only to adults’ (Fig. 5). The number of unique configura- 
tions that form functional, stable communities may be large and may not 
be easy to classify into a manageable number of distinct ‘types. Consider- 
ing this concept further in more extensive data sets made up of different 
ages, cultural traditions or geographic locations, and physiological or 
disease states will be important. 


Differences in the microbiota with disease 

The composition of the microbiota differs with the physiological state 
of the host. People who are obese harbour fewer types of microbes 
in their gut than people who are lean, and have a significantly differ- 
ent abundance of specific taxa and functional genes*’. Classification 


13 SEPTEMBER 2012 | VOL 489 | NATURE | 223 


© 2012 Macmillan Publishers Limited. All rights reserved 


REVIEW 


Ruminococcaceae 


Lachnospiraceae 


Figure 3 | Diversity of the human microbiota at different phylogenetic 
scales. The human microbiota is remarkably varied within and between 
individual hosts. Although this complexity can be simplified by evaluating 
communities at higher taxonomic levels, the species within each phylum 
have different biological properties, and significant changes detected at 
higher taxonomic levels are probably driven by only a subset of the species 
in those higher taxa. Using the 16S rRNA sequence data of four of the 
adults from the United States in ref. 4, the high diversity and variability 
among individuals, and the degree to which taxonomic grouping at high 
levels can mask this diversity is shown. Phylum level diversity can have 

a marked variation even across healthy adults in the same population. 
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Individual A has an unusually high proportion of Bacteroidetes; individual 
D has an unusually high proportion of Fusobacteria; and individuals B and 
C are dominated by Firmicutes and Bacteroidetes, which is the more typical 
phylum-level distribution for this cohort. However, even B and C differ 

at finer scales. The tree depicts the phylogenetic relationships between 
species-level phylotypes in just the most diverse of the phyla, Firmicutes, 
in individuals B and C. Branches specific to individual B are red; branches 
specific to individual C are blue; and shared branches are purple. Each 
individual has many unique phylotypes not found in the other. As 
described in many surveys of the human gut'*"*“°, the Ruminococcaceae 
and Lachnospiraceae families are particularly rich in phylotypes. 
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of people as lean or obese can be made solely on the basis of their gut 
microbiota with 90% accuracy**”, but they do not separate into distinct 
microbiota-based clusters on commonly used principal coordinates 
plots, which are used to identify statistical differences between groups. 
Thus, multiple statistical techniques are needed to show fully the differ- 
ences in the microbiota between different physiological states (Fig. 2). 

Some differences in the microbiota can contribute directly to disease 
states. Gnotobiotic mice that were raised germ-free then colonized with 
the microbiota from an obese mouse gained fat more rapidly than those 
colonized with the microbiota of a lean mouse”. A phenotype can 
emerge from different compositional backgrounds, which may indicate 
that specific components of the microbiota can exert large effects or that 
many different changes can lead to the same functional result. 

Differences in faecal microbial community diversity, composition and 
function have also been correlated with Crohn's disease’, ulcerative coli- 
tis’, irritable bowel syndrome (IBS)*, Clostridium difficile- associated 
disease (CDAD)” and acute diarrhoea*’. Sometimes, the nature of the 
microbiota deviation from health is consistent across individuals with 
the same disease. For instance, a twin study of IBD found marked and 
reproducible deviations in patients with ileal Crohn's disease relative to 
the controls, and more subtle, but characteristic, changes in patients with 
colonic Crohn’s disease”, and specific functional differences were also 
observed from metabolic profiling of the same samples”. Other diseases 
are associated with marked deviations from health that are inconsistent 
across individuals. For instance, individuals with recurrent CDAD had 
a phylum-level diversity that was very different from controls but not 
similar to each other”. Many disease studies are confounded by extensive 
use of treatments, such as antibiotics, that may obscure true disease-asso- 
ciated changes, highlighting the urgent need for prospective longitudinal 
studies that establish cause and effect. 


Parallels between host physiological states 

Studies of the microbiota often target one specific disease or state, 
but comparisons of the microbiota across many diseases can show 
common changes in the gut environment. Disturbed mucous lay- 
ers that line the intestinal cell wall and concomitant inflammation 
are seen in individuals with IBD, coeliac disease, HIV enteropathy, 
acute diarrhoea, diverticulosis, carcinoma and IBS”. Given these 
parallels, an increase or decrease in abundance of similar microbes 
across different disturbances might be expected”, but elucidation 
of these differences may require detailed biogeographical studies 
along the length of the gut — once safe and reliable means for such 
comprehensive sampling are developed. 

Perturbed adult gut microbial communities are intriguingly simi- 
lar to infant gut microbial communities. Both systems may represent 
successional communities in which the same opportunistic or fast- 
growing species can predominate”. For instance, C. difficile isa 
normal gut resident that can cause disease when antibiotics com- 
promise the stable adult gut communities; it also colonizes 2-65% 
of infants, although most infants are asymptomatic™””. Clostridium 
bolteae and Clostridium symbiosum are also associated with a dis- 
turbed gut and systemic infection, and are found in the infant gut®. 
The microbiota of individuals with ileal Crohn’s disease can also 
resemble that of infants: both have increased levels of Ruminococcus 
gnavus and Enterobacteriaceae in their stools, and an under-repre- 
sentation of the genera that are prevalent in healthy adults, including 
Faecalibacterium and Roseburia’’. These examples show the impor- 
tance of understanding whether generally opportunistic members of 
the gut microbiota have a selective advantage during early succession 
or disruption caused by disease, and therefore whether they are the 
side effects of disease rather than causal agents. 


Resilience of stable states 

Resilience is the amount of stress or perturbation that a system can 
tolerate before its trajectory changes towards a different equilib- 
rium state*’. Macroecosystem studies of human interference such as 
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Figure 4| Functional redundancy. The functional redundancy in microbial 
ecosystems may mirror that in macroecosystems. As shown in the HMP data 
set’, oral communities (top panels) and faecal communities (bottom panels) 
analysed using 16S rRNA (coloured by microbial phyla, left panels) show 
tremendous abundance diversity. The same samples analysed by shotgun 
metagenomics (panels on right) have remarkably similar functional profiles. 
Reprinted with permission from ref. 14. 


resource exploitation, pollution, land-use change and global warm- 
ing have shown how ecological systems can be transformed into less 
productive or less desirable states**’. For example, in a tropical lake 
community, a regime in which submerged plants dominate is pre- 
ferred to one with extensive free-floating plant cover because dense 
mats of floating plants create anoxic conditions that reduce animal 
biomass and diversity. However, pollution can cause floating plants 
to predominate because they are better at competing for light and can 
exclude submerged plants when the nutrient load is high. By under- 
standing the environmental drivers of conversion between states, 
interventions can be used to induce a regime change. For example, 
harvesting the floating plants once can induce a permanent shift to 
the submerged-plant-dominant state, but only if the nutrient loading 
is not too high”. 

Environmental studies can provide examples of the microbial 
response to perturbations, and perhaps an insight into how the gut 
microbiota might react. During the Deepwater Horizon oil spill in 
the Gulf of Mexico, the microbial community structure and func- 
tional gene repertoire in the deep-sea oil plume shifted, transiently, 
to an enrichment of microbes that were capable of hydrocarbon- 
degradation™””. This is similar to the effect of an extreme dietary 
change on the gut microbiota of mice that were moved from a low- 
fat, plant-rich diet to a high-fat, high-sugar diet’. In both cases, the 
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Figure 5 | Human microbial diversity and enterotypes. Enterotypes” were 
determined when evaluating only adults from the United States and Europe 
(circled in white). By including children from the United States and children 
and adults from developing countries, the picture of human-associated 
microbiota diversity greatly expands. The relationship between the microbiota 
of 531 healthy children and adults from Malawi, Amazonas state of Venezuela 
(Amerindians) and the United States was evaluated using sequences from 

the 16S rRNA gene in faecal samples and a principle coordinate analysis 

of unweighted UniFrac distances (adapted with permission from ref. 4). a, 
Infants differentiate strongly from adults, and b, adults from the United States 
have a distinct composition from those of Malawi and Venezuela, indicating 
the diversity differences are mainly owing to age and culture. 


microbial communities shifted substantially, probably because of the 
exposure to new substrates that provided a selective advantage to 
certain members of the community. Whether a particular disturbance 
will disrupt a stable state depends on the resilience of the community 
to a type of perturbation. 


Dietary changes 

Understanding the resilience of the gut microbiota is crucial for 
determining the efficacy of therapeutic diets. Consuming a carbo- 
hydrate- or fat-restricted low-calorie diet for a year’, or a high-fat and 
a low-fibre, or low-fat and high-fibre diet for 10 days” can induce 
statistically significant changes in the gut microbiota. However, 
these changes in species and gene content are small compared with 
the baseline variations that occur between individuals. Long-term 
dietary surveys and cross-cultural comparisons suggest that changes 
to diet might lead to regime changes over longer periods of time*”, 
perhaps by eroding the landscape of alternative stable states to allow 
changes that short-term ‘nudges’ cannot. 


Antibiotics 

Antibiotic administration can move the microbiota to alternative stable 
states. In healthy volunteers who were given two courses of ciprofloxacin 
in a 10-month period, the faecal microbiota reached a stable state simi- 
lar to, yet distinct from, the pretreatment state"’. However, the magni- 
tude of disturbance that occurred after ciprofloxacin treatment, and the 
speed and extent of recovery to the pre-ciprofloxacin state, suggested 
that resilience of the microbiota varies across individuals and between 
ciprofloxacin treatments in the same individual”. 

Long-term studies of the microbiota after antibiotic treatment indi- 
cate that post-antibiotic equilibrium states are themselves resilient. For 
example, clindamycin treatment affected Bacteroides in the gut for up to 2 
years after treatment had finished. Similarly, three individuals with dys- 
pepsia treated for 1 week with a combination of metronidazole, clarithro- 
mycin and omeprazole hada shift in their microbiota state that persisted 
for up to 4 years without additional antibiotic treatment’. In both cases, 
a significant increase in the antibiotic-resistance genes persisted for 
years”, suggesting that the post-disturbance state would probably be 
more resilient to the same disturbance again because a greater proportion 
of the microbiota would be resistant. This is consistent with a case from 
the study of healthy volunteers given the two courses of ciprofloxacin”. 
Initial recovery of the microbiota was slow and incomplete, stabilizing 
in a different interim state, but after a second treatment, recovery was 
relatively quick. But in the same study, a different case had the opposite 
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response, in which an essentially complete recovery occurred after the 
first ciprofloxacin treatment, but stabilized to a distinct state after the sec- 
ond treatment, suggesting that the initial antibiotic treatment decreased 
resilience”’, and that the resilience of the microbiota to future antibiotic 
treatments can vary considerably between individuals. 


New species invasion 

Resilience of the microbiota to challenge with exogenous microbes is also 
important. The gut microbiota generally shows colonization resistance, 
in which the native microbiota prohibits harmful (pathogenic)'’** and 
potentially beneficial (probiotic) microbes from establishing. Chal- 
lenging the gut microbiota with a foreign microbiota rather than with 
specific pathogens or probiotics has suggested the native community 
may be less resilient to colonization by exogenous microbes than previ- 
ously thought. When 14 conventionally raised Lewis rats were force fed 
with pooled microbiota from Sprague-Dawley and Wistar strain rats, 
the microbiota diversity of the Lewis rats increased substantially and 
changed to resemble that of the Sprague-Dawley and Wistar rats, and 
phylotypes established from the donors persisted for up to 3 months after 
transplantation®. In contrast to ecological theory — that disturbances 
eliminating native species facilitate the establishment of exotic species™ 
— transplantation after the resident microbiota have been depleted with 
two broad-spectrum antibiotics reduced rather than promoted the estab- 
lishment of the donor microbiota®. 

The success of microbiota transplantation in the treatment of recur- 
rent CDAD provides further support of the plasticity of an established 
gut microbiota when challenged with a complex one. Bacteriotherapy™ 
involves introducing a faecal sample from a healthy donor as a homogen- 
ate by injection into the caecum using a colonoscope. In one case study, 
the symptoms ofa patient with recurrent CDAD were resolved after fae- 
cal transplantation from a healthy donor. Before treatment, the recipient 
had a disturbed microbiota with Veillonella and Streptococcus predomi- 
nating, whereas the donor microbiota was dominated by Bacteroides. A 
month after treatment, the patient had no C. difficile in stool samples, 
but the donor species were present, indicating that the donor microbiota 
persisted over this interval. Faecal transplantation for CDAD is becom- 
ing increasingly common and seems to be highly effective. A survey of 
317 patients across 27 case series reported disease resolution in 92% 
of cases, with a single treatment being sufficient in 89% of cases®. The 
success of bacteriotherapy suggests that a previous course of antibiotic 
treatment reduces the resident microbiota to the extent that a donor 
microbiota can establish itself. However, because faecal transplantation 
is nota first-line therapy, there are no data on its effectiveness without 
antibiotics (A. Khoruts, personal communication). Understanding the 
successional processes and which microbial taxa allow a healthy micro- 
biota configuration to be established in faecal transplant recipients will 
require further studies. These studies will allow the development ofa 
standardized way of formulating and analysing the donor specimen, 
design and interpretation of dosing studies, and analysis of short- and 
long-term safety. 


Mechanisms for resilience 

Understanding how the microbiota forms a stable, resilient state 
would allow strategies to be devised to increase the resilience of 
healthy states or decrease the resilience of unhealthy states. For 
example, a healthy state with a high resilience to exogenous microbes 
might mean one person at a dinner party escapes food poisoning but 
their companions with lower resilience fall ill. A degraded microbiota 
with high resilience can contribute to chronic diarrhoea or inflam- 
mation, as is inherent with CDAD, IBD and IBS. 


Species and functional response diversity 

The aspects of diversity that are crucial for conferring resilience in 
macroecosystems are probably the same features that are impor- 
tant in microbial ecosystems. One such parameter is the num- 
ber of species present in a given system, or species richness. (In 
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culture-independent studies, this number depends on the number 
of sequences collected per sample.) Species-rich communities are 
less susceptible to invasion because they use limiting resources more 
efficiently, with different species specialized to each potentially lim- 
iting resource™. An excess of nutrients in water, or eutrophication, 
often decreases ecosystem diversity because a small number of spe- 
cies overgrow and outcompete everything else, with a concomitant 
decrease in resilience”. In a similar way, decreased diversity has been 
linked to obesity and to a diet that is high in fat and sugar compared 
with one that is low in fat and plant-based*’. Whether this decrease in 
microbiota diversity results in a decrease in resilience is not known. 
Low microbiota diversity also correlates with IBD” and recurrent 
CDAD”, but the effects on microbiota resilience are not known. 

Functional response diversity is the degree to which species ina 
community that contribute to the same ecosystem function vary in 
their sensitivity to ecosystem changes”. High functional response 
diversity may, for instance, allow a species that is relatively rare but 
functionally similar to fill a niche when an abundant species is com- 
promised by an environmental disturbance™. In a macroecology 
example, a compromised coral reef that was in a healthy state changed 
to an unhealthy, algal-dominated state only when both the algal- 
grazing fish were overfished and the sea urchins that had increased 
in numbers to fill this niche were compromised by a pathogen®. The 
same principle is also likely to apply to the gut microbiota. Follow- 
ing antibiotic treatment, a previously rare microbe may increase in 
abundance to fill a niche that had been dominated by a microbe with 
higher antibiotic sensitivity, leading to persistence of the same sta- 
ble state but with a decreased resilience because of the decrease in 
functional redundancy. 

Human-gut-adapted bacteria are likely to have high functional 
response diversity because phylogenetically disparate microbes often 
perform similar metabolic functions. For example, methanogenic 
Archaea, sulphate-reducing bacteria and phylogenetically diverse 
acetogens in humans and mice all consume hydrogen generated by 
other microbes during fermentation. Butyrate producers of the 
Clostridiales order can have different ecological strategies, such as 
adaptation to different stages of community succession, oxygen toler- 
ance and substrate preference. The abundance of Anaerostipes caccae 
peaks in infancy, whereas Eubacterium hallii and R. intestinalis are 
more abundant in adults. A. caccae is able to survive 10-60-minute 
periods of exposure to air better than E. hallii or R. intestinalis”. 
Furthermore, E. hallii but not F. prausnitzii can use lactate as a 
substrate’’. Thus, butyrate production in the gut can also continue 
through different successional and metabolic states. 


Competition and feedback loops 

The densely populated gut environment means that microbes com- 
pete to use the same resources or inhibit each other directly using 
antimicrobial products. Phylogenetically related bacteria could be 
expected to compete because of their overlapping functional roles, 
habitats or both, yet contrary to this expectation, phylogenetically 
similar species tend to appear in the same samples”. For exam- 
ple, analysis of bacteria with complete or draft genome sequences 
from 124 individuals from Europe found that the abundance of 
related Enterobacter species, including E. coli, Salmonella enterica, 
Citrobacter koseri and Enterobacter cancerogenus were positively cor- 
related across individuals”. These related species may share environ- 
mental preferences so that they are selected for simultaneously. The 
abundance of closely related species can also predict the susceptibility 
to intestinal colonization by both pathogenic and commensal bacteria 
in mice”. 

Feedback loops can either stabilize or destabilize the microbiota 
(Fig. 6). Stable physiological states are preserved by negative feed- 
back, in which a change to the gut environment results in oppos- 
ing changes that maintain homeostasis. This feedback is likely to be 
controlled by a tight interplay between microbial metabolic activities 
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and host pathways. For example, microbial metabolites might induce 
changes in the expression of the host pathways that control gut 
retention time so that it rises above or below the optimum, caus- 
ing diarrhoea or constipation. This deviation probably induces host 
signalling pathways to correct it. This is similar to body temperature 
regulation, in which a rise above the optimum induces thermoregu- 
latory mechanisms such as sweating, and a fall induces mechanisms 
such as shivering. Stabilizing the physiological parameters that are 
controlled by negative feedback from the host could therefore pro- 
mote resilience of the microbiota. 

Negative-feedback loops that promote microbiota resilience could 
also operate independently of the host. For example, if the abundance 
of a particular microbe exceeds a certain threshold, this would result in 
a change in the gut environment that would decrease the growth of that 
microbe relative to other species. These feedback loops may involve the 
accumulation of a phage specific to that microbe or the accumulation 
of a specific toxic metabolite. Negative feedback is thought to promote 
high ecosystem diversity, which can promote resilience”*. 

Positive feedback is thought to induce an ecosystem change 
because a difference from a point in one direction produces further 
change in the same direction. However, positive-feedback mecha- 
nisms could support stability at the individual microbe level or of 
the microbial consortia that promote each other's growth. Metabolic 
activities or interactions with host pathways in microbes may induce 
a physiological state that favours their growth over the growth of 
potential competitors, promoting resilience. The microbiota is likely 
to contain both important functional drivers of physiological sta- 
tus and microbes that co-occur with these drivers because they are 
able to thrive in such an environment. Invasion by microbes that do 
not thrive in that particular physiological state would be prevented, 
for example if a microbiota containing functional drivers of a low 
inflammatory state resists colonization by pathogens, and microbiota 
with functional drivers ofa relatively high inflammatory state resists 
colonization of beneficial microbes (Fig. 6). 

Positive- and negative-feedback loops are also likely to have a role 
in destabilizing the microbiota during regime changes, such as dur- 
ing succession in early development and following a disturbance 
(Fig. 6). Negative feedback, whereby an organism’ activity alters the 
environment so that fitness is decreased, can induce a directional 
change when microbes induce a physiological state that favours their 
competitors. For instance, the higher redox potential in the gut of 
infants is probably one of the factors that explains the relative suc- 
cess of facultative anaerobes such as E. coli or some Lactobacillus in 
early development”, but the reduction of oxygen that results from 
their metabolism favours their eventual replacement bya consortium 
dominated by strict anaerobes. 

The importance of feedback for successional changes is also likely 
to involve a complex interplay between the microbiota and its host. 
For example, the same change in redox potential that directly affects 
microbial fitness can also affect the expression of host factors in the 
gut epithelium, such as hypoxia-inducible factor, in early develop- 
ment and inflammatory diseases of the gut’’. Manipulating positive 
and negative feedback at the level of the host, the individual microbes 
and the entire gut ecosystem will be essential to maintaining healthy 
stable states and switching from an unhealthy to a healthy state. 


Prospects 

High-throughput sequencing of samples suggests that the micro- 
biota of each person has some resistance to perturbation, but that 
this resistance can be overcome by diet, drugs, prebiotics or pro- 
biotics. Dietary changes may alter the regime in the gut over long 
time periods. The surprising success of whole community trans- 
plants in healthy rats and in humans with CDAD shows that exog- 
enous microbes can colonize the gut even with resistance from an 
entrenched microbiota. However, which microbes will be the best col- 
onizers is unknown; neither do we know how a particular microbial 
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Figure 6 | Compositional transitions in the human gut microbiota. 
Primary succession of the gut microbiota during early development 
involves a systematic turnover of species until a stable adult state is 
reached. Positive- and negative-feedback loops probably have a role in 
driving primary succession and in conferring resilience to healthy stable 
equilibrium states. Acute disturbances, such as antibiotic administration, 
are often followed by an unstable state that progresses to a stable state 
through secondary succession. In some cases, a complete recovery occurs, 
in which the stable state highly resembles the pre-disturbance state, but 
sometimes the post-recovery stable state is distinct. Post-disturbance stable 
states may be both degraded and resilient; for example, IBS that forms after 
an initial acute disturbance of the microbiota from an enteropathogen 

can persist in some individuals for years and even decades”. Resilience 


configuration and its functional attributes change in response to 
dietary components or exogenous microbes. We need to learn what 
conditions promote the health of desired species and exclude the 
undesirable ones, just as a gardener would exclude weeds. 

The landscape of stable states for the microbiota and its implica- 
tions for resilience is an important research direction. Small perturba- 
tions, such as short-term dietary changes, may allow the microbiota to 
return to its original state, but larger disturbances, such as antibiotic 
administration, could cause movement into a different state. The long- 
term implications for health are not yet well understood. Furthermore, 
changes to the stable equilibrium of the gut microbiota through long- 
term changes, such as inflammation, diet or repeated antibiotic admin- 
istration, could make new states reachable with smaller perturbations. 
Host genetics, the factors that affect the microbiota early in develop- 
ment, diet and long-term drug administration may all contribute to 
individual differences. Consequently, the general landscape and the cur- 
rent community state could be important for determining individual 
responses to an intervention. 

How resilient the microbiota is to diet, disturbance by antibiotics 
and challenge by exogenous microbes will have important implica- 
tions for health care. The degree to which repeated applications of 
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of degraded states is probably driven by unique positive- and negative- 
feedback loops that occur both in concert with and independently of the 
host. Degradation to a stable state may also occur as a result of persistent 
stressors, such as poor diet, that slowly degrade resilience of a healthy state 
until a threshold is passed so that new feedback mechanisms are needed to 
maintain community composition and stability. Developing therapies that 
encourage transition from degraded to healthy stable states, or complete 
recovery to a healthy stable state following disturbance, may involve 
identifying species or combinations of species, and drivers of feedback 
loops. Whether interventions are more effective early in succession, 

when communities are more unstable but possibly stochastic, or later in 
succession, when convergence to the end point is more certain but the 
trajectory may be more difficult to change, will be important to investigate. 


broad-spectrum antibiotics degrade the microbiota and its ability 
to function as a healthy system would need to be studied, especially 
in children, in whom the development of interactions between the 
microbiota and host is crucial’. However, identifying suitable controls 
is challenging given the large variation in and between individuals 
during early development. Regime change is not always instigated by 
acute disturbances and can occur gradually, as indicated by studies of 
resilience to dietary changes. Individuals whose microbiota has been 
degraded by long-term consumption of a high-fat and high-sugar 
diet may need long-term dietary changes to restore their microbiota 
to a healthy state. 

The lower taxonomic diversity in individuals from Western cul- 
tures who have a high-fat and high-sugar diet raises the concern 
that global trends in diet could result in important microbial sym- 
bionts being lost from the broader population, possibly leading to 
the extinction of bacterial species that can provide important health 
benefits. Maintaining a collection of cultures from individuals in the 
developing world, and specifically from agrarian cultures, could help 
to preserve potentially important components of the microbiota. 

Microbes often produce unique states in the gut through collective 
activities and cooperative metabolism; understanding associations 
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between disease states and sets of species rather than single taxa will 
therefore be important”’. A central problem of culture-independent 
metagenomic analyses is that identified phylotypes, or collections 
of these, often represent species for which there is little biological 
knowledge. Gut microbial ecology has come full circle, with the 
development of methods for culturing most of the diversity present 
in a community being given increasing attention so that the con- 
tribution that community members make can be further explored. 
The human faecal microbiota is largely composed of readily cultured 
bacteria”. In vitro experiments with these isolates will be extremely 
valuable for exploring the metabolic attributes of a particular microbe 
or set of microbes, and the genes involved. However, because micro- 
biota composition and function are also controlled by feedback from 
the host, in vivo studies in gnotobiotic mice will be particularly valu- 
able. Indeed, the component of the microbiota that can be cultured 
has similar colonization dynamics, biogeographical distribution and 
response to dietary perturbations as the full microbiota, when trans- 
planted into gnotobiotic mice”. A translational science pipeline is 
developing in which particular phylotypes and their interactions that 
are important for health or that contribute to disease can be identi- 
fied on the basis of distribution patterns with disease. Furthermore, 
the biological attribute that may be driving these patterns can be 
predicted from the expression or prevalence of functional genes in 
whole communities and in genomes. Cultures and animal models can 
be used to test and verify these hypotheses before human application. 
The tools are now in place and although the path will be difficult, the 
direction is clearer. = 
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Reciprocal interactions of the intestinal 
microbiota and immune system 


Craig L. Maynard’, Charles O. Elson’, Robin D. Hatton! & Casey T. Weaver’ 


The emergence of the adaptive immune system in vertebrates set the stage for evolution of an advanced symbiotic 
relationship with the intestinal microbiota. The defining features of specificity and memory that characterize adap - 
tive immunity have afforded vertebrates the mechanisms for efficiently tailoring immune responses to diverse types of 
microbes, whether to promote mutualism or host defence. These same attributes can put the host at risk of immune- 
mediated diseases that are increasingly linked to the intestinal microbiota. Understanding how the adaptive immune 
system copes with the remarkable number and diversity of microbes that colonize the digestive tract, and how the system 
integrates with more primitive innate immune mechanisms to maintain immune homeostasis, holds considerable promise 
for new approaches to modulate immune networks to treat and prevent disease. 


because of the sterile environment of the womb. This germ- 

free existence is short-lived: birth exposes the newborn to 
the microbiota of the mother, setting in motion the colonization of 
mucosal tissues in the digestive, respiratory and urogenital tracts, and 
the skin by a diverse microbiota, which we coexist with throughout 
our lives. The complex and dynamic interaction between the micro- 
biota and its human host is the culmination of nearly half a billion 
years of co-evolution with vertebrates that has reciprocally shaped 
the repertoires of the microbiota and the immune system, such that 
the microbiota in humans is normally restrained and well-tolerated. 
The scope of this interaction is particularly evident in the intestinal 
tract, in which the greatest diversity and abundance of microbes reside. 
Estimated at approximately 100 trillion organisms, most of which are 
bacteria (although archaea and eukaryotes are also represented), the 
microbiota numbers about ten times the total cells in the human body, 
with the greatest density populating the distal ileum and colon’. The 
collective genome, or metagenome, of the intestinal microbiota has 
more than 100 times the number of genes of the human genome. Each 
individual is populated by roughly 15% of the 1,000 or more species of 
intestinal bacteria that have been described’, which reflects the substan- 
tial variability in the composition of the microbiota between individuals. 
There are, therefore, about tenfold more genes in each of our micro- 
biomes than in each of us, encoding the greatest source of potential 
antigens for the immune system to cope with, substantially exceeding 
those of self and pathogen-derived antigens. 

The relationship that has been forged between the intestinal microbiota 
and its human host provides mutual benefits. At homeostasis, the micro- 
biota benefits from the warm, nutrient-rich environment of the gut so it 
can establish a relatively stable ecosystem. Humans in turn benefit from 
ahighly adaptive metabolic engine that in addition to providing essential 
non-nutrient factors, such as vitamins, also substantially increases our 
ability to harvest nutrients from food. This increased digestive capacity is 
mainly a result of the microbiota’s complementation of the limited diver- 
sity of complex-carbohydrate-metabolizing enzymes that are encoded 
in the human genome. In addition, by establishing robust, interlinked 
metabolic or nutrient networks, and biofilms among its constituents, the 
microbiota limits the resources available to potential pathogens that must 
outcompete well-adapted and entrenched resident microbes for metabolic 


| } very one of us enters the world devoid of microbial colonization 


and physical niche space. Resident microbes thereby establish a microbial 
buffer that limits access by those not part of the consortium. 

However, the microbiota is not innocuous, and under conditions 
that compromise the ability of the host to limit the microbiota’s entry 
from the intestinal lumen, some species can invade host tissues to cause 
disease. Furthermore, shifts in the composition of the microbiota, 
whether induced by dietary changes, antibiotic treatment or invasive 
pathogens, can disturb the balance of organisms in the microbiota and 
alter the metabolic network of the collective to favour the outgrowth 
of potentially pathogenic constituents. Referred to as dysbiosis, such 
changes in the microbiota can perturb immune regulatory networks 
that normally restrain intestinal inflammation, and may contribute to 
immune-mediated disease directed against antigens of the microbiota. 
Dysbiosis is most often associated with inflammatory bowel disease 
(IBD), including Crohn’s disease and ulcerative colitis, and necrotiz- 
ing enterocolitis in premature infants. But it is also increasingly linked 
to a number of extraintestinal immune-mediated diseases, including 
rheumatoid arthritis, multiple sclerosis, diabetes, atopic dermatitis and 
asthma, as well as obesity and metabolic syndrome, all of which could 
have their pathogenic origins in untoward reactivity of the immune 
system to the microbiota. Whether dysbiosis is a cause or effect of these 
disorders remains to be determined, but understanding the factors that 
lead to alterations in the composition of the microbiota in these condi- 
tions promises to be informative, irrespective of their basis. 

In this Review, we highlight advances in our understanding of the 
immune mechanisms by which the dynamic interplay of the intestinal 
microbiota and its host normally favours a homeostatic, mutualistic 
relationship, as a basis for understanding the causes of breakdowns in 
this relationship that lead to disease. 


Co-evolution of the microbiota and adaptive immunity 

Vertebrate evolution coincided with the emergence of the adaptive 
immune system, the main components of which are T and B lym- 
phocytes. The acquisition of genetic recombinatorial mechanisms for 
generating diverse, anticipatory antigen-recognition receptors on T cells 
and B cells allowed specific responses to a vast diversity of antigens, 
and long-lived immune memory. Although evolutionary pressures that 
have shaped immune strategies are often viewed in the context of host 
defence, it has been proposed that the emergence of adaptive immunity 
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could have been driven as a means to foster, rather than limit, microbial 
colonization’. In this view, a substantial survival advantage would have 
been afforded to organisms that could harness the extended and more 
flexible metabolic capacity derived from a permanent, diverse intestinal 
microbiota, as long as the attendant infectious risk could be mitigated. 
Adaptive immunity might have allowed this. 

By adding new layers — such as secretory IgA (sIgA) — to existing 
innate barrier defences, early vertebrates with adaptive immune mecha- 
nisms could begin to promote colonization of their alimentary tracts by 
exerting selective pressures that favoured resident microbes that were 
relatively innocuous, but metabolically useful. The survival benefits that 
accrued as a result of derivation of nutrients from a broader range of foods 
could have been a major factor in the evolutionary success of vertebrates’. 
The emergence of adaptive immunity provided a means to recognize, and 
remember, both beneficial and detrimental members of the intestinal 
microbiota to foster maintenance of beneficial species at the expense of 
detrimental ones. In doing so, the emerging adaptive immune system 
would have had to acquire mechanisms for tempering innate immune 
responses programmed only for the clearance of microbes. The develop- 
ment ofa broad repertoire ofimmune cells that could suppress, as well as 
promote, innate inflammatory mechanisms contingent on the threat level 
of the microbe they recognized would become invaluable. 

Support for this hypothesis comes from the study of the diversity 
of the microbiota in invertebrates, which seems to be far less complex 
than the microbiota of vertebrates’. Although data on comparative 
metagenomics are limited, the highest density of bacteria found in either 
organisms or the environment is in the human colon’. The intestinal 
microbiota of vertebrate species as divergent as rodents and zebrafish 
share similar complexity to that of humans, and their gastrointestinal 
tracts are similarly colonized soon after birth’. Further, a homologue of 
slgA was recently identified in bony fish®. Now extending to all jawed 
vertebrate taxa, except cartilaginous fish and reptiles, this unique class 
of immunoglobulin is adapted for interactions with the commensal 
microbiota and mucosal pathogens. It is tempting to speculate that the 
development of multimeric mucosal IgA, and adaptations that effect 
its efficient transport across the mucosal epithelium, and the compart- 
mentalization of the gut-associated lymphoid tissues (GALT) from 
the peripheral immune system were essential adaptations that allowed 
vertebrates to harness a complex intestinal microbiota. Although more 
work is needed to establish a comparative map of the microbiota in 
the digestive tracts of more vertebrate and non-vertebrate species, it is 
anticipated that the merger of comparative studies in metagenomics 
and immunology will yield insight into the adaptive mechanisms that 
have allowed vertebrates to become an evolutionary success. It may yet 
prove that we owe it to the bugs that have become ‘us’ 


The microbiota and the developing immune system 

The mammalian immune system is perhaps the most elaborate exam- 
ple of the complex symbiotic relationship that has resulted from the 
co-evolution of vertebrates and their microbiota. Unlike other verte- 
brates, placental mammals give birth to live young that are carried to 
term in utero and are nursed with milk that is rich in maternal antibod- 
ies. These antibodies provide passive transfer of immunity from mother 
to infant, which has implications for the developing immune system of 
the infant, and the microbiota that colonize the gut. 


Maternal effects on the neonatal microbiome 

The greatest initial contribution to the composite human—microbiota 
‘superorganism is the vertical transmission of components of the moth- 
er’s microbiota to the child at birth. Normally, intestinal colonization of 
neonates is dominated by transmission of bacteria from the maternal 
vaginal flora, which is less diverse than that of the lower intestinal tract®. 
The ‘pioneer’ species received from the mother seem to be important, 
because infants born by Caesarean section — who are initially colonized 
by bacterial species of epidermal, rather than vaginal, origin — are pre- 
disposed to development of allergies and asthma later in life’. This is 
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despite the fact that, although Caesarean-section-born infants lag behind 
those born transvaginally in their acquisition of the two bacterial divi- 
sions dominant in the adult microbiota (Firmicutes and Bacteroidetes), 
they do catch up. Conversely, mother-to-child transmission in neonates 
delivered transvaginally rapidly shifts to skin and oral ecologies postpar- 
tum, such that these infants are later exposed to similar species as those 
delivered through Caesarean section. Thus, first contact could be deter- 
ministic. That is, pioneer bacterial species might have substantial and 
lasting effects on the immune response, irrespective of the composition 
of the mature microbiota. 

The neonatal microbiota varies erratically until about 1-year-old when 
it stabilizes, establishing a consortium that resembles that of adults’. 
During this initial period, the neonatal immune system rapidly matures 
under the influence of the microbiota. Although environmental factors 
such as diet, exposure to new microbes, xenobiotics, bacteriophages and 
intestinal infections have important roles in shaping the composition of 
the microbiota during this maturational window, the part the neonatal 
immune system plays is less clear. What is clear is the initial and ongoing 
influence of breastfeeding on the infant's microbiome. In addition to a 
unique mix of nutrients and antimicrobial proteins that influence the 
ecology of the neonatal microbiota, breast milk provides abundant sIgA, 
the specificities of which have been shaped by the maternal microbiota. 
The mucosal immune memory of the mother is thereby transmitted 
to her offspring. Thus, in breastfed infants delivered transvaginally, the 
intestinal microbiota is not only seeded by maternal bacterial species, 
but its composition may also be reinforced and shaped by the maternal 
sIgA repertoire that is influenced, in turn, by the maternal microbiota. 

Maturation of the infant mucosal immune system takes months, so 
the passive transfer of maternal sIgA has a considerable protective role 
against potential pathogens that could perturb the ecological trajectory of 
the infant's intestinal microbiota. Maternal sIgA also shields the neonatal 
immune system from its own microbiota, perhaps so the neonate’s defences 
are not overwhelmed before they are fully developed. In so far as microbial 
antigens that are bound by sIgA are handled by the innate immune system 
ina ‘tolerogenic mode, partly owing to IgA°s poor fixation of complement, 
transfer of maternal sIgA to the infant seems to favour the establishment 
of regulatory immune networks in the infant that promote a mutualistic 
relationship with the microbiota. Thus, in addition to the direct effect of 
maternal sIgA on the microbiota of the infant, it also has immunomodu- 
latory effects on the developing infant’s immune repertoire so as to indi- 
rectly influence its microbiota. The maternal-neonate metagenomic bond 
is, therefore, extended postnatally, providing one mechanism by which 
microbial ecologies tend to cluster in family members. 


Microbiome effects on developing host-barrier defence 
Maturation of the intestinal mucosa and its GALT — Peyer’s patches of 
the distal ileum, isolated lymphoid follicles (ILFs) and mesenteric lymph 
nodes (MLNs) — is initiated by, and contingent on, intestinal coloniza- 
tion. Peyer’s patches and MLNs develop prenatally, but ILFs develop 
postnatally’. However, each of these lymphoid tissues requires signals 
derived from the sensing of intestinal microbiota for their complete 
development, recruitment of a mature complement of immune cells, 
or both. Similarly, non-lymphoid structures of the intestinal mucosa 
that contribute to the establishment of host-microbiota mutualism are 
driven by colonization of the neonate. 

Preparation for the adaptive immune response to neonatal coloniza- 
tion requires the prenatal actions of a subset of innate lymphoid cells 
(ILCs), termed lymphoid tissue inducer (LTi) cells (Fig. 1). LTi cells are 
instrumental in prenatal organization of the development of lymphoid 
tissues, including the components of the GALT. LTi cells develop in the 
fetal liver from a common lymphoid precursor that gives rise to all lym- 
phoid cells. During fetal development, LTi cells disseminate to the MLN 
and Peyer's patches anlagen, stimulating the development of these struc- 
tures, and the recruitment and partitioning of B and T cells into B-cell 
follicles and T-cell zones that characterize secondary lymphoid tissues”. 

The development of ILFs is also dependent on LTi cells, but is initiated 
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Figure 1 | The gut-associated lymphoid tissue establishes perinatal host- 
microbiota mutualism in the intestine. a, Prenatally, secondary lymphoid 
tissues (Peyer's patches and mesenteric lymph nodes) and cryptopatches 
develop by the spatiotemporal recruitment of lymphoid tissue inducer 

(LTi) cells to sites of the developing intestine and supporting neurovascular 
structures. This, in turn, stimulates the recruitment of dendritic cells, T 

cells and B cells in preparation for the immune response to the microbiota. 
Intraepithelial lymphocytes seed the epithelium before birth. b, Postnatally, 
bacteria colonize the neonatal intestine immediately, initiating multiple events 
that affect the development or functional maturation of the mucosa and 
gut-associated lymphoid tissues. Shown from left to right: microbe-associated 
molecular patterns (MAMPs) sensed by pattern-recognition receptors on 
intestinal epithelial cells and dendritic cells adjacent to cryptopatches stimulate 
the further recruitment of B cells and T cells, causing the cryptopatches 

to develop into mature isolated lymphoid follicles. The isolated lymphoid 
follicles release IgA-producing plasma cells — which are formed through 


Paneth cell 


after birth when these cells cluster in the lamina propria below the intes- 
tinal crypts, forming cryptopatches"’. ILFs develop from cryptopatches 
only after colonization by the intestinal microbiota. Central to sens- 
ing the colonizers of the intestinal tract is the expression of a diverse 
range of germline-encoded pattern-recognition receptors (PRRs) by 
intestinal epithelial cells (IECs) and immune cells resident in the gut. 
These receptors include transmembrane Toll-like receptors (TLRs) 
and C-type lectin receptors (CLRs) that reside on the cell surface or in 
endosomes, and cytosolic nucleotide-binding oligomerization domain 
(NOD)-like receptors (NLRs), which cells of the intestinal mucosa use 
to detect microbe-associated molecular patterns (MAMPs) that are 
expressed by constituents of the resident microbiota, as well as patho- 
gens. MAMPsare recognized — through mechanisms that are not well 
understood — by cells in the neonatal gut, stimulating ILF development 
to generate a lymphoid structure that is capable of supporting the matu- 
ration of B cells that produce sIgA. At least one of the microbial products 
that induces this process is peptidoglycan derived from Gram-negative 
bacteria. Recognition of peptidoglycan by NOD1 in IECs elicits produc- 
tion of CCL20 and B-defensin 3 that direct the recruitment of B cells to 
LTi-dendritic-cell clusters in cryptopatches to induce the expression 
of sIgA (ref.12). 

Unlike MLNs and Peyer's patches, for which class-switch recombi- 
nation to produce IgA is T-cell dependent, IgA class-switching in ILFs 
can occur through T-cell-dependent and -independent pathways”. In 
addition, MLNs and Peyer's patches are unique to mammals, whereas 
ILFs are found in non-mammalian vertebrates, suggesting that they 
are the evolutionary forerunners of the secondary lymphoid tissues of 
the GALT™ and may have evolved to accommodate, and promote, an 
increasingly complex intestinal microbiota before specialized secondary 
lymphoid tissues emerged. The importance of ILFs in regulating the 
microbiota is evident in mice that are devoid of these structures, and in 
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T-cell-dependent and independent interactions — into the lamina propria. 
Microbes also cross the epithelium and enter Peyer's patches through M 

cells, from which they are endocytosed by dendritic cells in the subepithelial 
dome. Antigen-loaded dendritic cells in the Peyer’s patch interact with 

local lymphocytes to induce T-cell differentiation and T-cell-dependent 

B-cell maturation in the germinal centre to induce the development of IgA- 
producing plasma cells that home to the lamina propria, where they release 
dimeric IgA for transport into the intestinal lumen. Dendritic-cell-mediated 
luminal sampling of microbial products or transcytosis of bacteria across the 
epithelium results in antigen loading of lamina propria dendritic cells, which 
then migrate through the afferent lymphatics vessels (not shown) to a draining 
mesenteric lymph node to induce differentiation of effector T cells that traffic 
to the lamina propria. Shown on the far right, sensing of MAMPs stimulates the 
proliferation of intestinal epithelial cells in crypts, resulting in their increased 
depth and, in the small intestine, increased density of Paneth cells. This sensing 
also arms the intestinal epithelial cells for release of antimicrobial peptides. 


which Gram-negative bacteria are over-represented””. Similar to Peyer's 
patches, ILFs develop in the intestinal mucosa in intimate association 
with the overlying intestinal epithelium (Fig. 1). Both structures are 
strategically deployed to sample the microbiota through specialized 
microfold, or M, cells, which transport intact microbes or their products 
across the epithelial barrier. In this way, the innate and adaptive immune 
systems constantly monitor the microbiota and are ‘informed’ of the 
dominant bacterial species that reside in proximity to the epithelium. 

Like the GALT, the epithelium of the fetal and newborns intestine 
is incompletely adapted to dense bacterial colonization, but responds 
rapidly to perinatal colonization to accommodate the microbiota. 
Expression of — and signalling by — TLR4, which is the receptor for the 
abundant product of Gram-negative bacteria, lipopolysaccharide, are 
significantly increased antecedent to intestinal colonization at birth”. 
Within hours of exposure to the microbiota, the response of IECs to 
lipopolysaccharide is markedly attenuated through the down-regulation 
of TLR4 and components of its signalling apparatus. This increase in 
TLR4 expression and signalling followed by down-regulation seems 
to represent a mechanism whereby the fetal epithelium is primed to 
transmit transient inflammatory signals on sensing the microbiota to 
promote an early immune response, but then becomes rapidly desensi- 
tized as an adaptation to the mounting bacterial load. 

An additional immune mechanism that prepares the neonate for 
commensal colonization is the seeding of the intestinal epithelium 
with specialized intraepithelial lymphocytes before birth. Intraepithelial 
lymphocytes intercalate between IECs where they exist in an activated 
state and can respond rapidly to microbial encroachment. Throughout 
life, intestinal intraepithelial lymphocytes help to maintain the integrity 
of the epithelial-cell barrier, limit bacterial translocation and facilitate 
epithelium repair after injury by secreting soluble mediators, such as 
antimicrobial peptides”®. 
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Although the host has evolved elaborate developmental strategies 
to prepare for postnatal colonization, the microbiota reciprocates by 
inducing maturation of the host’s immune system. Therefore, compared 
with conventionally raised mice, germ-free or ‘germ-reduced’ mice have 
reduced size and cellularity of secondary lymphoid tissues as well as 
altered numbers, frequencies or diversity of immune-cell populations 
at mucosal sites, and thus mount abnormal responses to infection and 
injury. In addition, a postnatal time window is thought to exist in which 
regulatory T (T,.g) cells’’ and invariant-natural killer T (INK T) cells"® 
that have received their ‘primary education’ in the thymus, receive 
a ‘secondary educatiom from the microbiota to establish a durable 
immune repertoire that is crucial to the prevention of inflammatory 
diseases in childhood and adulthood. 


Innate pathways to sense and restrain the microbiota 

The intestinal tract is the largest barrier tissue in the human body (it has 
a surface area of about 300 m’ in adults), making it the most extensive 
portal for entry of commensal or pathogenic microbes. Highly special- 
ized barrier defences have evolved to confine the microbiota and resist 
pathogens, while maintaining its main function of nutrient uptake. The 
strategy is one of a layered defence that integrates a stratified mucous layer, 
a relatively impenetrable but highly responsive epithelium, and alamina 
propria populated by innate and adaptive immune cells that actively 
participate in homeostatic responses to restrain the microbiota without 
undue inflammation, yet are poised for the induction of antimicrobial 
clearance responses and tissue repair, should the barrier be breached. 


Reciprocity between the epithelium and the microbiota 

The epithelium is central to the orchestration of intestinal defences. 
Not simply a passive barrier to microbial translocation, the intestinal 
epithelium is an active sensor of, and conduit for, the dialogue between 
the host and microbiota. The five main cell types that comprise the epi- 
thelium — absorptive enterocytes, goblet cells, Paneth cells, M cells and 
enteroendocrine cells — develop from a common stem cell located near 
the base of the intestinal crypts. Each cell type has a specialized, integral 
role in intestinal homeostasis, and is both responsive to the microbiota 
and conditioned by it (Fig. 1). 

Maintaining the polarized structure of the epithelial barrier is crucial 
to its function. The tight junctions that seal the interfaces of adjacent 
IECs segregate the epithelium into an apical, lumen-exposed surface 
and a basolateral surface anchored to the basement membrane. Depend- 
ing on whether the membrane-associated PRRs are arrayed on the apical 
or basolateral surface of the IECs and on which side of the epithelial 
barrier MAMPs are detected, cells of the epithelium initiate responses 
that either promote the release of protective factors that are directed 
luminally (for example, secreted mucins and antimicrobial peptides) 
to directly restrain the microbiota, or internally (for example, cytokines 


and chemokines) to either promote immune quiescence at homeostasis 
or activate inflammatory immune responses when the epithelial barrier 
has been breached. 

Asa mucosal tissue, the intestinal epithelium continuously produces 
and is invested by a layer of mucus that is a first line of defence against 
microbes (Fig. 2). Mucus is produced by goblet cells and is composed 
of heavily glycosylated mucin proteins, as well as other protective mol- 
ecules, such as trefoil factor, that contribute to epithelial restitution 
and repair. Production of intestinal mucus is regulated by products 
of the microbiota. In germ-free mice the mucous layer in the colon is 
highly attenuated, despite normal numbers of mucin-laden goblet cells. 
Addition of the MAMP lipopolysaccharide or peptidoglycan stimulates 
the release of mucin by goblet cells and the rapid reconstitution of the 
colonic inner mucous layer’. Butyrate produced by benign constituents 
of the microbiota also promotes increased release of mucin, providing 
a positive-feedback loop for maintenance of the mucous barrier and 
its colonization by butyrate-producing commensals. The importance 
of the mucus layer is evident in mice deficient for principal intestinal 
mucin, Muc2. These mice have increased translocation of commensal 
and pathogenic bacteria”’, and spontaneously develop colitis”. Com- 
ponents of the healthy microbiota therefore directly contribute to the 
barrier function of the intestine through their induction of mucin pro- 
duction and secretion by goblet cells. 

The thickness and continuity of intestinal mucus differs regionally: 
it is thinner and discontinuous in the proximal small intestine and 
becomes thicker and continuous in the distal small and large intes- 
tine”, showing some correlation with the local bacterial load (10°-10° 
organisms per gram of luminal contents in the duodenum and jejunum; 
about 10° organisms per gram in the ileum, and 10'°-10" organisms per 
gram in the colon). The mucus is stratified into two functionally distinct 
layers: a compact, firmly adherent inner layer that is sparsely populated 
by bacteria and a more loosely structured, non-adherent outer layer that 
is at least tenfold more densely populated by the microbiota”. 

Although both mucous layers have a similar Muc2-dominated com- 
position, proteolytic cleavage of Muc2’s polypeptide backbone in the 
outer layer results in its expanded volume and accessibility to coloniza- 
tion by components of the microbiota”. Indeed, the outer mucous layer 
provides an anchor for the attachment of bacteria of the microbiota that 
can establish biofilms that exclude pathogens. Furthermore, in addi- 
tion to dietary glycans, mucin glycans are nutrients for some constitu- 
ents of the microbiota, such as Bifidobacterium and Bacteroides spp., 
thereby promoting their retention in the collective (Box 1). These 
bacteria ferment complex O-linked mucin glycans to produce short- 
chain fatty acids (SCFAs) derived from mucin catabolism (for example, 
acetate and lactate) that are toxic to some pathogens”, and produce 
other metabolites (such as, proprionate and butyrate) that are the 
main nutrient source for colonic IECs. The SCFAs also signal through 


BOX1 


An important aspect of the microbiota’s use of the intestinal mucous 
layer as an ecological niche is its decoration by blood-group antigens. 
Analogous to blood-group antigens on erythrocytes, the assembly 

of type A, B or Lewis-b glycans on intestinal mucins is contingent 

on the generation of the core H-glycan by the actions of FUT2. This 
protein is encoded by FUT2, which is functional in most individuals 
(secretor genotype), but is non-functional in a significant minority 
(about 20% of Caucasians, referred to as non-secretors) owing to a 
missense mutation. The presence or absence of a functional FUT2 
allele correlates strongly with the composition of the microbiota’. In 
particular, Bifidobacterium spp., which are a beneficial component of 
the microbiota, are dependent on the terminal blood-group glycans 


Modulation of the microbiota by blood-group antigens 


for colonization of the intestinal mucus and are less abundant in 
non-secretor genotypes. The non-secretor phenotype is associated 
with necrotizing enterocolitis and Gram-negative sepsis in premature 
infants?®, as well as those with Crohn’s disease®’. Therefore, it seems 
that the protective benefits of colonization by Bifidobacterium spp. are 
a mutually beneficial evolutionary adaptation of this commensal to its 
host. Given the predisposition to dysbiosis and its deleterious health 
effects in non-secretors, it is likely that evolutionary pressure to retain 
the defective FUT2 allele in the population is a result of mucin-linked 
blood-group glycans also serving as sites of attachment for mucosal 
viruses (Such as noroviruses) so that those with a non-secretor 
genotype are protected! 
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G-protein-coupled receptors on IECs, to 

downregulate host inflammatory responses™. 

Thus, colonization of the outer mucous layer X e 
by commensals is an important adaptation that n° 

supports a stable host-microbiota relationship. 

In contrast with the outer mucous layer, the a 
inner mucous layer provides a relatively imper- @ @ sass 
meable barrier against the microbiota. This 
characteristic is a result of the layer’s compact 
physicochemical structure and its function as 
a reservoir for microbicidal products of the 
epithelium, including antimicrobial peptides 
that are specialized to kill different classes of 
microbes and sIgA, which is retained in the 
mucous layer after being shuttled across the 
epithelium by polymeric immunoglobulin 
receptor (pIgR) (Fig. 2). Effectively, the inner 
mucous layer is a ‘killing field’ that few patho- 
gens or commensals have evolved strategies 
to penetrate. This microbe-sparse zone is 
enforced, in part, by the antibacterial lectin 
Reg-IIIy (ref. 25). Because Reg-IITy is selec- 
tively bactericidal for Gram-positive bacteria, 
it is likely that antimicrobial peptides specific 
for Gram-negative bacteria also contribute to 
the microbe-sparse zone. A similar spatial seg- 
regation of bacteria from the epithelium has 
been identified in the colon, in which a thicker 
inner mucous layer excludes most bacteria 
even in the face of a substantially higher bacte- 
rial load”. 

The region of the intestinal tract with the 
poorest coverage of protective mucous, the 
proximal small intestine”, also has the greatest 
exposed epithelial surface area owing to its prominent villous structure. 
Although this feature favours digestion and absorption of nutrients it 
also seems to make the epithelium in this region particularly vulner- 
able to entry by the microbiota and pathogens. However, the bacterial 
loads in this region are the lowest along the length of the intestine, with 
over a million-fold fewer bacteria per unit of luminal contents than the 
large intestine. This is, in part, owing to the more vigorous peristaltic 
motility of the proximal small intestine, which rapidly clears mate- 
rial, including microbes, from the lumen. The lumen of the proximal 
small intestine is also the entry point for the contents of the gall bladder 
and stomach, which contain high levels of bile salts and acid that have 
antimicrobial effects. 

In addition, the bases of the crypts of the small intestine are home to 
many Paneth cells. These IECs are arrayed with a range of PRRs and 
are specialized for the production and release of abundant antimicro- 
bial peptides, including a-defensins, which are small, highly cationic 
microbicides unique to Paneth cells. In contrast with other antimicro- 
bial peptides, such as Reg-IIIy, synthesis of which requires signals from 
the microbiota”®, a-defensins are synthesized and stored in Paneth-cell 
granules without the need for sensing of MAMPs. However, the release 
of Paneth-cell granules is induced by MAMPs, and the secretion of 
active a-defensins has been shown to control the composition of the 
microbiota”. NOD2, which was the first susceptibility gene linked to 
Crohn’s disease”*”*, encodes an NLR that is important in sensing the 
microbiota to control the release of antimicrobial peptides by Paneth 
cells. Deficiencies of antimicrobial peptide production, such as occur 
in NOD2 mutants, are thought to result in altered microbiota compo- 
sition and density in the small intestine that heighten susceptibility to 
intestinal inflammation, particularly in the terminal ileum where the 
highest density of Paneth cells are found. 

Colonic IECs also regulate the composition of microbiota 
through PRR-dependent mechanisms. Mice deficient in the NLRP6 
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Figure 2 | The barrier function of the intestinal epithelium. Distinct subpopulations of intestinal 
epithelial cells (IECs) are integrated into a continuous, single cell layer that is divided into apical and 
basolateral regions by tight junctions. Enterocytes, in the small intestine, and colonocytes in the large 
intestine, as well as specialized Paneth cells in the bases of small intestinal crypts continually sense the 
microbiota to induce the production of antimicrobial peptides (AMPs). Goblet cells produce mucin, 

that is organized into a dense, more highly cross-linked inner proteoglycan gel that forms an IEC- 
adherent inner mucous layer, and a less densely cross-linked outer mucous layer. The outer layer is highly 
colonized by constituents of the microbiota. The inner mucous layer is largely impervious to bacterial 
colonization or penetration due to its high concentration of bactericidal AMPs, as well as commensal- 
specific secretory IgA (sIgA), which is ferried across IECs from their basolateral surface, where it is bound 
by the polymeric immunoglobulin receptor (pIgR), to the inner mucous layer where it is released by 
proteolytic cleavage of pIgR. Responding to the microbiota, innate lymphoid cells, including RORyt- and 
AhR-expressing LTi and NK-22 cells, produce IL-22, which stimulates AMP production and promotes 
epithelial barrier integrity. 


inflammasome in colonic IECs have an altered microbiota that con- 
fers increased susceptibility to colitis because of damage to the colonic 
epithelium”. The increased susceptibility to colitis that results from 
NLRP6 deficiency is transmissible to wild-type mice, indicating that a 
dysbiotic flora is a contributory factor. How deficiency of the NLRP6 
inflammasome results in an altered microbiota is incompletely defined, 
although reduced interleukin (IL)-18 levels in NLRP6-deficient mice 
suggests this cytokine has an important role. 


Reciprocity between the epithelium and innate immune cells 
Although the intestinal mucous layer largely insulates the intestinal 
epithelium from direct interactions with the microbiota, bacterial 
metabolites and components are able to permeate this zone and alter 
gene expression in IECs through PRRs”. In addition to the epithelial 
products that are secreted apically to restrict contact with the microbiota 
(for example, mucins and antimicrobial peptides), the epithelium also 
produces factors, such as chemokines and cytokines, that are secreted 
basolaterally. These factors signal immune cells residing internal of the 
epithelium, particularly in the intestinal lamina propria (Fig. 3). Acti- 
vation of PRRs typically promotes pro-inflammatory innate responses, 
so the intestinal epithelium had to evolve strategies to mitigate these 
responses for commensals such that, at homeostasis, cytokine signals 
transmitted to mucosal immune cells limit inflammation. Several mecha- 
nisms by which this anti-inflammatory state is favoured have emerged. 
The strategic distribution of TLRs and NLRs on and within IECs 
has a considerable effect on whether bacterial MAMPs will be recog- 
nized, and if so, whether their recognition promotes pro-inflammatory 
responses or represses them. Pathogenic bacteria possess virulence 
factors to allow them to attach to or invade IECs thereby introduc- 
ing MAMPs into the IEC cytosol where they are recognized by NLRs; 
however, bacterial strains of the indigenous microbiota are non-invasive 
and therefore less potent activators of NLRs. In the intact epithelium, 
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MAMPs derived from commensals are largely restricted to interactions 
with apically accessible TLRs, which seem to be functionally dampened 
by tonic exposure to the microbiota, whether through decreased expres- 
sion, impaired signalling or both. 

By contrast, expression of TLRs on the basolateral aspects of IECs 
can signal a barrier breach by both commensals and pathogens. This is 
exemplified by TLRS, the receptor for flagellin, which is expressed on 
the basolateral aspects of the epithelium where it can detect the repeated 
flagellin monomers that make up bacterial flagellae*’. Detection of 
flagellin on the ‘host’ side of the intestinal epithelial barrier results in 
epithelial-cell activation, which initiates inflammatory responses by 
innate and adaptive immune cells with the aim of eradicating the invad- 
ing bacterium. The importance of flagellin as an immune-activating 
MAMP and an antigen targeted by adaptive immune cells is reflected 
in its high representation among antigens bound by serum antibodies 
from colitic mice and in patients with Crohn’s disease”. 

Apical expression of TLRs and restricted access to cytosolic NLRs are 
adaptations to constrain interactions with luminal microbes, but some 
commensals have also evolved strategies that actively dampen TLR 
signalling in IECs, should contact occur. Bacteroides and Lactobacillus 
spp. inhibit activation of the classical NF-«B pathway, which is central 
to induction of pro-inflammatory gene expression downstream of all 
PRRs. Active NF-«B dimers are sequestered in the cytosol by associa- 
tion with IkB complexes. Release of NF-«B for nuclear localization is 
contingent on the phosphorylation of IkB, by which it is targeted for 
ubiquitylation and protesomal degradation downstream of PRR signal- 
ling. Contact of IECs with these commensals inhibits IxB degradation, 
thereby blocking nuclear transport of NF-«B (ref. 34). Bacteroides spp. 
can further dampen NF-«B signalling by enhancing the nuclear export 
of NF-«B by inducing increased expression of the nuclear receptor fam- 
ily member, peroxisome-proliferation-activated receptor-y (PPAR-y). 
PPAR-y binds NF-«B in the nucleus and shuttles it back to the cyto- 
plasm, thereby terminating its transactivation of pro-inflammatory 
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Figure 3 | The epithelial-innate-adaptive 
continuum in response to microbial antigens. a, 
In response to the microbiota, IECs secrete mucins 
and AMPs that limit microbial interaction with 
epithelial cells. Under homeostatic, eubiotic 
conditions, MAMPs stimulate the secretion of 
cytokines (including TSLP, IL-33, IL-25, and TGFB) 
by IECs that promote development of tolerogenic 
macrophages and dendritic cells. Dendritic cells, in 
turn, induce the development of induced T,.. (iT...) 
cells through a TGFB- and retinoic acid (RA)- 
dependent process. Through multiple mechanisms, 
including the secretion of TGF§ and IL-10 by iT, 
cells and the secretion of IL-10 by macrophages, 
the anti-inflammatory balance of the intestine is 
maintained by inhibiting or dampening potential 
effector responses. In addition, T,,., cell-derived 
TGFf and epithelial-derived BAFF and APRIL 
promote development of IgA” plasma cells to 
ensure an abundant supply of sIgA in the lumen 


, Pathogenic bacteria, 
viruses and injury 


TH17 that further limits microbial interaction with the 
eee me epithelium. b, In the face of pathogen invasion, 
a mucosal injury or dysbiosis, MAMPs stimulate the 


secretion of pro-inflammatory cytokines by IECs 
(including, IL-6, IL-1 and IL-18) and intestinal 
dendritic cells and macrophages (including IL-6, 
IL-23 and IL-12) that induce development of the 
effector CD4° T cells T,;1 and T,,17, the latter of 
which can transition to the former as a result of 
IL-23 or IL-12 signalling. Intestinal innate lymphoid 
cells, including NK-like cells, LTi cells, and 

ySIELs, respond to pro-inflammatory cytokines 

to upregulate IL-22, which helps to protect the 
epithelial barrier, and IL-17A and IL-17E, which are 
involved in neutrophil recruitment. 
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genes’. Thus, in addition to host mechanisms that normally restrain 
direct contact between commensals and IECs, co-adaptation of the 
microbiota and host have provided mechanisms that suppress pro- 
inflammatory engagement of PRRs in the intact epithelium. 

The epithelium also supports tolerance of the microbiota by condi- 
tioning intestinal dendritic-cells through directional release of immu- 
nomodulatory cytokines into the lamina propria. At homeostasis, IECs 
secrete thymic stromal lymphopoietin (TSLP), IL-33 and IL-25, which 
promote tolerogenic activities of a subset of intestinal dendritic-cells 
defined by expression of integrin a,8, complex (also known as CD103)**. 
Antigens presented by CD103* dendritic-cells favour the development 
of Tg cells and IgA* plasma cells that ‘home’ to the lamina propria at 
which they repress inflammatory responses to the microbiota’’. IECs 
also produce abundant TGF-, which suppresses NF-«B-dependent 
pro-inflammatory signalling in intestinal macrophages and dendritic 
cells, and promotes development and maintenance of T,., cells and IgA* 
plasma cells. TLR-activated IECs can directly promote T-independent 
development of sIgA producing plasma cells in ILFs by the release of 
both B-cell activating factor (BAFF) anda proliferation-inducing ligand 
(APRIL)™. In addition to cytokines, IECs express other factors that limit 
pro-inflammatory responses of mucosal immune cells, including prod- 
ucts of tryptophan metabolism and prostaglandins (for example, PGE,). 

Like IECs, intestinal macrophages normally express lower levels of 
TLRs and are hyporesponsive to TLR signalling to dampen inflam- 
matory responses to the commensal microbiota. This is despite their 
retention of highly active phagocytic and bactericidal activities”. 
Also like IECs, this inflammatory anergy is probably multifactorial, 
but reflects the limited entry of commensal bacteria or their prod- 
ucts into the cytosol in which they can activate NLRs — despite their 
uptake and degradation in phagolysosomes. Although phagocytosis of 
commensal bacteria induces pro-IL-1 in intestinal macrophages, its 
cleavage by caspase-1 to produce the active form requires NLRP3- or 
NLRC4-containing inflammasomes, which are activated by MAMPs 
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that are more effectively delivered into the cytosol by pathogenic, rather 
than commensal, bacteria*’. Therefore, phagocytes in the intestinal 
lamina propria are conditioned to mount inflammatory responses 
against pathogens that deliver MAMPs to cytosolic sensors, but to 
efficiently clear commensals in a non-inflammatory manner. 


Innate lymphoid-cells response to epithelial barrier breach 
Although LTi cells are indispensable in the development of the 
GALT, they are only one subset of a growing family of ILCs that have 
emerged as important contributors to intestinal homeostasis and bar- 
rier defence in the postnatal immune system*™. The diversity of ILC 
subsets seems to parallel that of CD4* T cells, with which they share 
many functional features. However, the antigen-specific-receptor- 
lacking ILCs are recruited to mucosal immune responses mainly by 
cytokines, which are produced by IECs or intestinal myeloid cells 
(dendritic cells and macrophages). 

ILCs are distributed in the intestinal lamina propria and GALT, in 
which they are poised to respond rapidly to microbes that penetrate the 
epithelial defences. LTi cells are particularly important to barrier defence 
against bacterial incursions”, as are a subset of natural-killer-like cells 
(referred to as NK-22 or ILC22 cells), which seem to share many features 
with LTi cells‘. Common to these cells, and to CD4' T cells of the Ty17 
lineage (discussed later) is their expression of the transcription factor 
retinoic acid-related orphan receptor (ROR)yt, which is required for 
these cells’ development and function’, and production of the cytokines 
IL-22, IL-17A and IL-17F (ref. 45). IL-22 is a member of the IL-10 
cytokine family that acts on epithelial cells of barrier tissues, including 
the intestine, to enhance antimicrobial defence and epithelial barrier 
integrity. IL-17A and IL-17F target innate immune cells, stromal cells 
and epithelial cells to induce cytokines and chemokines (for example, 
G-CSF and CXCL8) that induce increased neutrophil production and 
recruitment, respectively. 

RORyt* ILCs seem to be the main source of IL-22 at steady state 
and, in a continuous feedback loop with the microbiota, release IL-22, 
IL-17 or both in response to microbiota-induced production of IL-23 
by macrophages and dendritic cells. These cytokines stimulate epithelial 
-cell secretion of antimicrobial peptides that inhibit or kill bacteria in 
the vicinity of the epithelial cell surface. Although ILCs can contribute 
to intestinal pathology under certain circumstances”, they are crucial 
in the early host response to enteropathogenic bacteria”. 

RORyt* ILCs also express the aryl hydrocarbon receptor (AhR) with 
which they sense microbiota metabolites and xenobiotics*“’. AhR sig- 
nalling is required both for ILC maintenance and IL-22 production™. 
Akin to its essential role in GALT development, ILC production of lym- 
photoxin provides an important amplifying loop for the production of 
IL-22 by ILCs, stimulating dendritic cell production of IL-23 that, in 
turn, increases IL-22 production by ILCs””*. In addition to RORyt and 
AhR, another transcription factor shared between innate and adaptive 
immune cells that influences the composition of the microbiota is T-bet. 
This transcription factor is involved in T,,1 cell development, and is also 
expressed by a distinct subset of ILCs. Under pathogen-free conditions, 
RAG-deficient mice, which lack B and T cells, are relatively resistant 
to inflammation driven by their microbiota, despite the absence of 
an adaptive immune system. This is largely owing to compensatory 
increases in ILCs. However, when deficient for T-bet, RAG-deficient 
mice develop spontaneous intestinal inflammation resembling ulcera- 
tive colitis (known as T-bet” RAG“ ulcerative colitis (TRUC) mice)”. 
Disease is a result of overexpression of TNFa by dendritic cells and an 
absence of T,,., cells. Remarkably, transfer of the microbiota from TRUC 
Mice to immunocompetent mice transfers disease, reflecting the out- 
growth of two pathogenic bacterial strains: Klebsiella pneumoniae and 
Proteus mirabilis. Although details of the mechanisms responsible for 
T-bet-dependent restraint of the microbiota remain to be defined, the 
dysbiosis in TRUC mice highlights the importance of innate immune 
cells in regulating the composition of the microbiota and is evidence that 
a dysbiotic flora can drive disease in otherwise healthy hosts. 
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Adaptive immunity in homeostatic responses to microbiota 
The interplay between the microbiota, intestinal epithelium and innate 
and adaptive immune cells at homeostasis favours the dominance of reg- 
ulatory networks that prevent inflammation or immune-mediated dis- 
ease. Remarkably, the innate defences are sufficiently robust that much 
of the host response to the microbiota progresses without involvement 
of CD4* T-cell-dependent responses. Furthermore, a portion of the sIgA 
response that contributes to partitioning of commensal organisms away 
from the epithelium is generated in ILFs without the requirement for T 
cells’*. When the adaptive immune response is recruited, it is typically 
limited to the mucosal tissues such that systemic immunity is not gener- 
ated. This compartmentalization of mucosal immunity is a result of the 
programming of adaptive immune cells to travel back to the mucosae 
following their differentiation in Peyer’s patches or MLNs. Finally, at 
steady state, antigen presentation by intestinal dendritic cells favours the 
development of regulatory CD4" T cells, which suppress the develop- 
ment of pro-inflammatory innate and effector T-cell responses to avert 
excessive inflammation. 


Regulatory T-cell responses to the microbiota 

CD4" T,,., cells are essential for maintenance of mutualism with the 
microbiota. The frequencies of these T cells are considerably elevated in 
the intestine relative to other tissues”, and microbiota-induced T,.gcells 
affect the composition of the microbiota’. The crucial role of T,.gCells 
in immune homeostasis to the microbiota is well-documented by the 
consequences of these cell’s absence™. T,,.,-cell deficiency, irrespective 
of the means by which it is induced, results in unopposed effector T-cell 
responses and IBD, driven by reactivity to antigens of the microbiota. 
Indeed, much of our understanding of the inflammatory potential of 
the commensal microbiota has been garnered through models in which 
regulatory pathways have been disrupted™. The largely non-overlapping 
TCR specificities of regulatory and effector T-cell subsets in the colonic 
lamina propria and associated lymphoid tissues suggest that distinct 
antigenic epitopes, whether derived from the same or distinct members 
of the microbiota, control these competing T-cell fates>. 

Regulatory CD4° T cells include subsets that are distinguished on the 
basis of their expression of the transcription factor, Foxp3. Foxp3” (T’,.) 
or Foxp3- (T regulatory cells 1 (Tr1)) are each characterized by their 
production of IL-10, one of the main immunoregulatory cytokines 
required for immune tolerance of the intestinal microbiota. The non- 
redundant role of IL-10 in intestinal immune homeostasis is well- 
established; IL-10-deficient mice develop spontaneous, unremitting 
colonic inflammation driven by IL-23 and the T,,17 pathway”. The 
essential role of TLR in sensing the microbiota in this process is evident 
by the absence of disease in germ-free IL-10-deficient mice and mice 
deficient for both IL-10 and MyD88 (ref. 58, 59). Although non-T-cells 
produce IL-10, T-cell-derived IL-10 is crucial for intestinal immune 
homeostasis. Thus, IL-10 deficiency that is restricted to total CD4* T 
cells results in spontaneous colitis that has a severity comparable with 
that in mice with global loss of IL-10 (ref. 60). Although deficiency of 
IL-10 that is limited to Foxp3"* T,,., cells also induces colitis, its sever- 
ity is reduced, indicating that Foxp3’ CD4" T cells contribute to the 
protective IL-10 response. 

Myeloid cells of the innate immune system are the main targets for 
regulatory actions of IL-10 (ref. 61), although IL-10 signalling in CD4* 
T cells also seems to be contributory”. Importantly, polymorphisms 
in the IL-10 locus confer susceptibility to IBD in humans™, and early- 
onset IBD has been linked to mutations in IL-10 receptor components 
that impair signalling™. 

Remarkably, IL-10-producing T cells can be induced to develop 
in response to specific commensals or their products. The capsular 
polysaccharide-A moiety of the common commensal Bacteroides 
fragilis mediates interaction of B. fragilis with the colonic mucosa, 
facilitating both colonization and activation of an anti-inflammatory 
cascade®. Polysaccharide-A promotes expansion of colonic IL-10-ex- 
pressing Foxp3" T,,, cells, through the TLR2-MyD88 pathway. The 
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expression of TLR2 on CD4 T-cells suggests there is the capacity for 
direct regulation by microbial products that favours polysaccharide- 
A-dependent T,,,-cell responses over T,;1 responses”, although there 
are other known TLR2 ligands that do not promote induction of Foxp3 
or IL-10 (ref. 66). 

A cocktail of Clostridium species, mostly comprised of clusters IV 
and XIVa, drives the efficient expansion of T,,, cells in the colons of 
germ-free mice, seemingly owing to their superior ability to elicit TGFB 
production in the intestinal mucosa™. In contrast with induction of 
T 2 Cells by B. fragilis, Clostridium-dependent T,., expansion occurs 
independently of MyD88 through mechanisms yet to be defined™. To 
date, there are no studies using otherwise complete microbiota that are 
devoid of B. fragilis or this specific collection of Clostridium spp., leav- 
ing open the issue of the level of redundancy in this process. However, 
B. fragilis is absent from the microbiota of 20-30% of humans, indicat- 
ing the existence of other pathways to T,,,-cell development. Accord- 
ingly, default induction or expansion of T,,, cells can be achieved by 
colonization of germ-free mice with a defined altered Schaedler flora 
(ASF) that consists of eight species of the microbiota dominated by 
Bacteroides distasonis, but devoid of B. fragilis®. Thus, although there 
seems to bea proclivity of certain components of the microbiota to pro- 
mote T,,.-cell development, this does not seem to be a unique property 
of any one microbial component, reflecting substantial redundancy in 
the system to ensure that T,,,-cell dominance is established. 


Effector T-cell responses to the microbiota 

Similar to T,,, cells and B cells, the largest deployment of effector CD4* 
T-cells in the healthy immune system is in the intestines, and is largely 
driven by the microbiota. Although the intestines mount robust T},2 
responses to helminth infestations, in countries where these infections 
are less common the main effector T-cell subsets resident in the gut 
express cytokines characteristic of T,,17 and T,1 cells. 

Ty17 cells in particular, which are thought to be the most ancient of 
effector T-cell subsets’, seem to have evolved to bolster mucosal barrier 
defences to promote mutualism with the microbiota. T,,17 cells share 
developmental requirements for RORyt and AhR (ref. 70) with intes- 
tinal ILCs, as well as a similar effector cytokine repertoire, including 
IL-17A, IL-17F and IL-22. Furthermore, T,,17 cells have developmen- 
tal ties with T,,, cells through their mutual requirement for the abun- 
dant intestinal cytokine TGF (ref. 4). Finally, the T,,17 developmental 
pathway is distinguished by considerable plasticity”’, allowing divergent 
functional programs contingent on local pro- or anti-inflammatory 
cues. Although a last line of defence, effector CD4* T-cells are perpetu- 
ally engaged in the host-microbiota dialogue by favouring regulatory 
benefits at homeostasis or pathological consequences when dysregulated. 

The contribution of the microbiota to intestinal T,,17-cell develop- 
ment is highlighted by the virtual absence of this subset in germ-free 
mice”. In addition to IL-6, which is required for T,,17 differentia- 
tion and its deviation away from induced T,,.. programming, multiple 
microbiota-dependent factors favour T,,17 development in the intes- 
tines, including TGFf, IL-1 (ref. 73), IL-23 (ref. 74) and even ATP 
derived from commensal bacteria”’. Similar to the ability of a limited 
quorum of commensal bacterial species to disproportionately influence 
induced T,,,-cell development, minor constituents of the microbiota 
can amplify intestinal T,,17 cell numbers. In mice, an unusually potent, 
but not unique, inducer of T,;17 cells” is the Clostridia sp. Candidatus 
arthromitus or segmented filamentous bacteria (SEB)”””*. This bacte- 
rium populates the ileum and caecum and has long been known to be 
a potent activator of intestinal immune responses”. Induction of T,17 
cells by SFB provides protection against gut pathogens”, suggesting 
that general amplification of T,17 effector cells can be host-protective. 
However, T,,17 induction by SFB is not entirely benign, because mono- 
association of mice with SFB induces T,,17-mediated inflammatory 
arthritis” and multiple-sclerosis-like symptoms in the experimental 
autoimmune encephalomyelitis (EAE) model®’. Remarkably, therefore, 
sensing ofa single constituent of the intestinal microbiota can promote 
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autoimmunity in extraintestinal tissues. At present, it is unclear whether 
SEB, or related organisms, exert similar effects in humans. 

Whether at steady state, responding to infection or chronically 
inflamed as a result of IBD, the intestine contains, in addition to effec- 
tors that express IL-17, T cells that express IFNy or both IL-17 and IFNy. 
Of note, T,,17 cells retain a high capacity for divergent cytokine expres- 
sion profiles and function, or plasticity, after their commitment to the 
Ty17 pathway. Specifically, T,,17 cells can give rise to IFNy producers 
that resemble classical T,;1 cells. This has been demonstrated in humans 
and mice’**', and has suggested that many of the T,;1 cells found in the 
intestine arise from the T,,17 pathway (Fig. 3). The relative contribu- 
tions of T,;1 cells to immune protection or pathogenesis remain to be 
elucidated, whether they are derived from the T17 pathway or not. 


Balancing regulatory and effector responses 

To balance accommodation of the microbiota and the need to mount 
host defences against microbial invasion, the adaptive immune system 
has evolved to direct the development of distinct CD4° T-cell fates by 
different APC subsets and the microbe-induced factors they produce. 
Although T,,,-cell dominance is the default at homeostasis, the shared 
dependence of induced T,,, and T,,17-cell development on TGF6 pro- 
vides an elegant means to alternately divert programming of naive CD4* 
T cells from a homeostatic, microbe-tolerant response to an inflamma- 
tory, microbe-clearing response contingent on the cofactors integrated 
with TGF signalling’ (Fig. 3). 

At homeostasis, naive CD4* T-cell recognition of antigens derived 
from the microbiota favours induced T,,,-cell development by virtue of 
the production, by intestinal dendritic cells, of the vitamin A metabolite 
all-trans-retinoic acid” ™*. Vitamin A is not synthesized by the host so 
its influence on the regulatory tone of the mucosal immune response is 
predicated from adequate dietary intake. Retinoic acid is derived from 
vitamin A by the sequential actions of the ubiquitous enzyme, alcohol 
dehydrogenase and one of three retinal-specific aldehyde dehydroge- 
nases, which are more restricted in their tissue distribution, including 
CD103* dendritic cells and IECs in the intestine. The potency of retinoic 
acid as a cofactor for TGFB-dependent development of induced T,,,. 
cells — and suppression of T,,17 development — and its constitutive 
production by a regulatory subset of intestinal dendritic cells at steady 
state reflect the robust default pathway to T-cell-mediated regulation. 
Under conditions in which microbial antigens promote production of 
pro-inflammatory cytokines by dendritic cells, retinoic acid is either 
repressed® or co-opted by pro-inflammatory pathways**” to override 
T-cell induction. 

In common with T,,17 cells, and perhaps reflecting an aspect of 
shared TGFB-dependent programming that favours multipotency, 
induced T,,, cells display developmental plasticity that influences 
adaptive immunity to the microbiota. When acted on by pro-inflam- 
matory cytokines, induced T,,, cells can down-regulate Foxp3 and 
up-regulate RORyt (ref. 88), and T,.., cells can be converted into IL-17- 
or IFNy-expressing effectors during infection®’. Notably, although 
Foxp3* T,,. cells may be diverted to effector responses, the reverse is 
less apparent; established effector T cells, including T,,17 cells, seem 
to be more resistant to reprogramming to become Foxp3* regulatory 
cells. Accordingly, microbial antigens that are identified as products of 
a pathogen are likely to be remembered as such through their imprint- 
ing of an effector T-cell response. By contrast, microbial antigens that 
promote induced T,,, responses at homeostasis, may, in the context of 
an inflammatory response, reprogram induced T,,, cells to generate 
an effector T-cell response that breaks the ‘tolerance’ to that microbe. 

In a further example of the developmental plasticity of T,,., cells, 
recent studies report their role in IgA class-switch recombination in the 
gut, representing another arm of T,,,cell function in promoting immune 
homeostasis to the microbiota. TGF has long been recognized as the 
principal switch factor for development of IgA-producing B cells. Ina 
new twist on TGF(’s role in this process, induced T,,., cells were found 
to down-modulate Foxp3 and acquire features of follicular helper T cells 
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that promoted IgA production in Peyer’s patches”. Depletion of T,.. 
cells also caused a rapid loss of IgA* plasma cells and sIgA production in 
the intestine”, indicating that T,,,, cells participate both in the develop- 
ment and maintenance of intestinal IgA* B cells. Although known to be 
important in host defence against infections, the main role of sIgA is to 
establish mutualism with the intestinal microbiota, as evidenced by its 
marked depletion in germ-free mice and systemic IgG response to the 
microbiota in animals that specifically lack sIgA. Coupled with the fact 
that about 75% of sIgA reactive to the microbiota develops through the 
T-cell-dependent pathway”””’, this suggests that sIgA is mainly regu- 
lated by intestinal T,,, cells. In addition to providing evidence of further 
plasticity in the T,,,, developmental program, these findings identify 
an additional link between adaptive immune networks that cope with 
the commensal flora, and extend the long-established role of TGF in 
promoting host-microbiota mutualism*, 


Dysregulated immunity to the microbiota 

Owing to their defined antigenic specificity and durable memory, effec- 
tor CD4* T cells are a liability when inappropriately directed against 
self-antigens or, in the case of the microbiota, the ‘extended’ self. In 
view of the enormous number of antigens expressed in the intestinal 
metagenome, it is remarkable that dysregulated effector responses to 
the microbiota are the exception. When they do occur, the result is 
IBD, including Crohn’s disease or ulcerative colitis, which have distinct 
clinical and histopathological features, but acommon requirement for 
the intestinal microbiota. Similar to autoimmunity directed against 
host self-antigens, dysregulated CD4* T-cell responses to antigens of 
the microbiota leads to chronic, typically relapsing and remitting disease 
that reflects the immune system's inability to eliminate the antigens that 
drive the abnormal response. 

That being said, inflammatory immune responses in the intestines 
result in detectable alterations of the resident microbiota, irrespective of 
the cause”. This presents a challenge in our attempts to understand how 
certain microbes positively or negatively affect the disease process in IBD 
because it is difficult to draw conclusions without knowing the compo- 
sition of the microbiota before diagnosis. As such, our understanding 
of the antigenic components of the microbiota that are the targets of 
immune attack in IBD are limited. However, experimental mouse models 
and patients with Crohn's disease show a remarkably shared reactivity 
to specific flagellin epitopes expressed by Clostridium spp., which seem 
to be immunodominant despite their minor representation within the 
commensal flora***. What attributes of these commensals make them 
particularly common targets in a large subset of patients with Crohn's 
disease — but not ulcerative colitis — are unknown. However, it is likely 
to reflect unique properties that place them in specific geographical 
niches in the microbiota or in intimate contact with the immune system 
where their expression of flagella, which is less typical for commensal 
bacteria, make them unusually proficient at inducing effector rather than 
regulatory responses. Irrespective of the basis, retention of these organ- 
isms in the collective despite the pathogenic risk they pose implies there 
is a benefit to host-microbiota mutualism that has been evolutionarily 
conserved and remains to be understood. 

In view of the prominent role of the T;17 pathway in adaptive 
immunity to the intestinal microbiota and propensity for generating 
inflammation promoted by both IL-17 and IFNy, it is not surprising that 
this pathway has emerged as a major contributor to IBD pathogenesis. 
Prior to the discovery of T,;17, Crohn’s disease and ulcerative colitis were 
viewed in the context of T,,1- and T,,2-centric mechanisms, respectively; 
but, increasingly, data from genome-wide association studies implicate 
the contributions of the T,,17 pathway in both diseases’. The discovery 
of IL-23 revolutionized views on the immunopathogenesis of autoim- 
munity and led to discovery of T,;17 cells”*. It is therefore fitting that 
this was the first cytokine linked to IBD by genome-wide association 
studies; variants of its receptor (IL-23R) have been found to confer both 
protection and susceptibility to Crohn’s disease and ulcerative colitis”. 

As the number of genome-wide association and next-generation 
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sequencing studies has proliferated, so has the number of genes linked 
to the T,,17 pathway in IBD (ref. 95). This has been supplemented by 
genes of innate and adaptive immune pathways that are integrated with 
the T,,17 response and control it, spanning the gamut from epithelial 
barrier integrity maintenance and restitution to microbial sensing, to 
immunomodulatory cytokines. Because individual susceptibility alleles 
typically confer minor risk, multiple genetic susceptibility alleles are 
generally required for disease. Efforts to define this clustering of risk 
alleles is increasingly defining genotypes that confer substantial risk, 
as well as identifying interlaced susceptibility pathways that identify 
immunopathogenetic subsets of IBD — including those that do not 
directly implicate the T,,17 pathway. However, as more IBD suscep- 
tibility genotypes are found to overlap with extraintestinal immune- 
mediated diseases that do share a T,;17 pathogenesis, they increasingly 
implicate a role for the microbiota in conferring disease risk beyond the 
confines of the gut. As we advance into the era of integrated metagen- 
omics of the human-microbiota ‘superorganism, the opportunities for 
personalized medicine that were originally envisioned from sequencing 
of the human genome may well multiply in proportion to the increase 
in genes contributed by our microbiota, with attendant opportunities 
for more specifically targeted treatments and prevention of diseases that 
have their origins in the merger of vertebrates and their microbiomes 
millions of years ago. = 
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Functional interactions between the gut 
microbiota and host metabolism 


Valentina Tremaroli!? & Fredrik Bickhed!*" 


The link between the microbes in the human gut and the development of obesity, cardiovascular disease and metabolic 
syndromes, such as type 2 diabetes, is becoming clearer. However, because of the complexity of the microbial community, 
the functional connections are less well understood. Studies in both mice and humans are helping to show what effect the 
gut microbiota has on host metabolism by improving energy yield from food and modulating dietary or the host-derived 
compounds that alter host metabolic pathways. Through increased knowledge of the mechanisms involved in the inter- 
actions between the microbiota and its host, we will be in a better position to develop treatments for metabolic disease. 


hanges to lifestyle and an increase in the availability of 

energy-rich foods are important contributors to the world- 

wide obesity epidemic. The microbial inhabitants of the gut 
can also have an influence on metabolic processes, such as energy 
extraction from food, and should be considered an environmen- 
tal factor that contributes to obesity and its comorbidities (such as 
insulin resistance, diabetes and cardiovascular disease). 

Culture-independent methods to study microbial communi- 
ties have advanced our knowledge of this human gut microbiota 
(Box 1) (see page 250 of this issue). Profiling of the common proxy 
for this community, the faecal microbiota, by 16S ribosomal RNA 
surveys and by direct sequencing of genetic material have shown that 
the human gut microbiota is a complex community of 100 trillion 
archaeal and bacterial cells distributed over more than 1,000 species’ 
(Box 2). The community is dominated by bacteria, with more than 
90% of the species belonging to Firmicutes and Bacteroidetes. Each 
person has a distinct and highly variable microbiota, but a conserved 
set of gut colonizers (the core gut microbiota) and genes (the core 
microbiome) are shared among individuals’* and may be required 
for the correct functioning of the gut. 

Germ-free mice are those born and reared without exposure to 
any live microbes, and they provide a powerful tool for understand- 
ing the effects of the gut microbiota on host physiology. These mice 
can be colonized either with selected microbial species or whole 
communities from mice or humans to examine the transmissibility 
of physiological and pathological phenotypes, and to test what role 
the microbiota has ina particular phenotype. The gut microbiota 
in these mice modulates bone-mass density’ and promotes fat stor- 
age’, intestinal angiogenesis”® and the development of an immune 
response’® (see page 231 of this issue). In this Review, we discuss the 
metagenomic and gnotobiotic-based evidence for the role of the gut 
microbiota in energy metabolism and the possible links with obesity. 


Obesity 

Gut microbiota composition is altered in people who are obese, 
and it can respond to changes in body weight. Genetically obese 
ob/ob mice’ are hyperphagic as a result of a mutation in the gene 
that encodes the satiety-promoting hormone leptin. The cae- 
cal microbiota of these mice contains more Firmicutes and fewer 
Bacteroidetes than that of their lean wild-type littermates, even when 


the mice are fed the same low-fat, polysaccharide-rich diet’. Similar 
changes have also been seen in the faecal microbiota of humans who 
are obese’’. Bacteroidetes levels increase when weight is reduced, 
either by fat- or carbohydrate-restricted diets", suggesting that 
Bacteroidetes may be responsive to calorie intake. A similar effect 
has also been observed in people who lost weight after a Roux-en-Y 
gastric bypass procedure. In these patients, increased levels of 
Bacteroides and Prevotella were negatively correlated with energy 
intake and adiposity'’. Other studies showed no such shift in the 
Firmicutes—Bacteroidetes ratio'*’, but this may be because they 
used different clinical criteria (such as level of obesity, age, degree 
of weight loss and duration of calorie restriction), geographical 
locations, population sizes and microbiota-profiling methodolo- 
gies. Although obesity and energy intake can affect the microbial 
composition, whether the gut microbiota contributes to obesity in 
humans is unclear. 

A gastric bypass promotes sustained weight reduction and dimin- 
ishes the risk of diabetes and cardiovascular disease for people who 
are obese’*”*. This knowledge has allowed the relationship between 
microbiota and obesity to be explored further. After a gastric bypass, 
diabetes can resolve before patients begin to lose weight, suggesting 
that this type of surgery has a direct antidiabetic effect. Exactly how 
this happens is not clear, but a shift in the composition of the faecal 
microbiota of humans'’™ suggests the gut microbiota contributes 
to the improved metabolic phenotype after a gastric bypass. The 
beneficial microbe Faecalibacterium prausnitzii, in particular, is 
less abundant in patients who are obese and diabetic, but increases 
after surgery’. Levels of E prausnitzii are negatively correlated with 
inflammatory markers, indicating that the bacterium may modulate 
systemic inflammation (common to diabetes and obesity) and con- 
tribute to the amelioration of diabetes. In addition, germ-free mice 
do not develop diet-induced obesity, and treatment of obese mice 
with antibiotics reduces adiposity and adipose inflammation, and 
improves glucose metabolism”, further supporting the benefits 
of inducing a shift in microbiota composition. 


Energy harvest 

Carbohydrates are important sources of energy for human and 
microbial cells. Human enzymes cannot degrade most complex car- 
bohydrates and plant polysaccharides. Instead, the non-digestible 
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carbohydrates, including cellulose, xylans, resistant starch and 
inulin, are fermented in the colon by its microbiota to yield energy 
for microbial growth and end products such as short-chain fatty 
acids (SCFAs) (Fig. 1), mainly acetate, propionate and butyrate, 
which have profound effects on gut health as, for example, an energy 
source, an inflammation modulator, a vasodilator and part of gut 
motility and wound healing. In addition, SCFAs are energy sub- 
strates for the colonic epithelium (butyrate) and peripheral tissues 
(acetate and propionate)”. The patterns of intestinal fermenta- 
tion, and consequently the types and amount of SCFAs produced, 
are determined by how much carbohydrate is consumed and the 
composition of the gut microbiota. For example, fermentation of 
dietary fructans increases when gnotobiotic mice that have been 
colonized with Bacteroides thetaiotaomicron, are co-colonized with 
Methanobrevibacter smithii”'. B. thetaiotaomicron produces more 
acetate and formate, and M. smithii uses formate for methanogenesis. 
The interactions promote more efficient carbohydrate fermentation 
and increased energy absorption from the gut, resulting in increased 
adiposity in the co-colonized mice compared with mice colonized 
with only B. thetaiotaomicron. The composition of the gut microbi- 
ota and the metabolic interactions between its species may therefore 
affect food digestion and energy harvest. 

Direct evidence for the role of the microbiota in energy harvest 
and fat deposition comes from germ-free rats, which have reduced 
intestinal levels of SCFAs”’, and twice as much urinary and fae- 
cal excretion of calories as that of conventional rats fed the same 
polysaccharide-rich diet”. The germ-free rodents compensate 
for the reduced energy harvest by increasing their food intake”. 
Germ-free mice also have reduced adiposity compared with their 
conventional counterparts, but adiposity is normalized when they 
are colonized with a healthy microbiota for 14 days*"’. Microbial 
energy harvest in obesity has been investigated in conventional 
genetically obese ob/ob mice, which have increased amounts of 
SCFAs in their caecum and reduced energy content in their faeces 
compared with their lean littermates”. Metagenomic sequencing of 
the caecal microbiota showed an enrichment of gene functions that 
were related to the degradation of dietary polysaccharides in the 
microbiome of ob/ob mice™. This finding was also true of humans: 
the faecal microbiota of people who are obese has an increased 
capacity to harvest energy’. In mice, the obese phenotype was 
transmissible through microbiota transplants, and germ-free mice 
colonized with the microbiota from obese donors gained twice as 
much fat as those colonized with the microbiota from lean donors™. 

The role of the gut microbiota in promoting energy harvest from 
diet and fat deposition has been clearly demonstrated in mice, but 
most of the evidence in humans has come from indirect studies. 
For instance, people who are obese have higher levels of ethanol in 
their breath than lean people”, indicating altered fermentation and 
a greater number of faecal SCFAs"’, which may suggest increased 
microbial energy harvest. 


Diet alters the gut microbiota 

Diet is known to modulate the composition of the gut microbiota 
in humans and mice. Long-term dietary habits have a consider- 
able effect on the human gut microbiota. For example, children in 
a rural African village, who consumed high amounts of plant pol- 
ysaccharides, had low levels of Firmicutes and increased levels of 
Bacteroidetes — mainly Prevotella and Xylanibacter — in their faecal 
microbiota compared with Italian children, who had high levels of 
Enterobacteriaceae — mainly Shigella and Escherichia”. Prevotella 
and Xylanibacter are known to degrade cellulose and xylans, and 
are associated with increased faecal SCFAs, suggesting that the gut 
microbiota of the children living in rural Africa had adapted to maxi- 
mize energy extraction from a diet rich in fibre. Human gut micro- 
biota can be divided into three discrete compositions. However, this 
concept is currently being challenged as enterotypes maybe more of 


REVIEW 


BOX1 
Terminology 


@ Enterotype is the grouping of the microbiota of a given person 
into discrete configurations. But recent data have conflicted 
with this definition and suggest that enterotypes may be 

less discrete. 
@ Gnotobiotics is the study of animals living ina 
microbiologically defined environment, either germ-free or 
colonized with known bacteria. 

@ Inflammasomes are protein complexes containing 

an intracellular sensor (such as a nucleotide-binding 
oligomerization domain (NOD)-like receptor), the procaspase-1 
precursor and the ASC (apoptosis-associated speck-like protein 
containing a caspase activation and recruitment domain) 
adaptor protein. These complexes recognize microbe- and 

the host-derived inflammatory signals, microbial-associated 
molecular patterns and damage-associated molecular patterns, 
and its activation leads to the maturation of inflammatory 
cytokines (such as interleukin-18 and interleukin-18). 
Inflammasomes participate in antimicrobial innate immune 
responses, but may also have a role in metabolic diseases, such 
as obesity, type 2 diabetes and atherosclerosis. 

@ Metagenome is the total DNA that can be extracted from 

an environment. The human metagenome is the aggregate 

of the DNA of the host and the microbiota. Metagenome and 
microbiome are often used interchangeably. 

@ Metagenomics is the study of the metagenome (microbiome). 
Metagenomics can either be targeted (usually 16S ribosomal 
RNA) or untargeted (shotgun sequencing). 

@ Microbiota is the collective microbial community inhabiting 

a specific environment. Cellular density increases along 

the length of the gut, and the colonic microbiota is the 

densest and most diverse community in the gut, and in the 
whole human body. 

@ Microbiome is the collective genomic content of a microbiota. 
It also indicates the total genetic capacity of the community. 

@ Probiotics are defined as live microorganisms that, when 
administered in adequate amounts, confer a health benefit 

for the host®*. Many bacterial strains in the Lactobacillus and 
Bifidobacterium genera are considered to be probiotic. 

@ Prebiotics are non-digestible food ingredients that, when 
consumed in sufficient amounts, selectively stimulate the 
growth, activity or both of one or a limited number of microbial 
genera or species in the gut microbiota that confer(s) health 
benefits to the host®. Inulin and trans-galacto-oligosaccharides 
are defined as prebiotics because they are resistant to gastric 
digestion and hydrolysis by human enzymes; are fermented by 
specific members of the gut microbiota; and induce selective 
growth, activity or both of beneficial intestinal bacteria®°. Both 
inulin and trans-galacto-oligosaccharides stimulate the growth 
of Bifidobacterium, an effect defined as bifidogenic. 


a gradient than discrete entities. Each enterotype is dominated by a 
different genus — Bacteroides, Prevotella or Ruminococcus”’ — but 
not affected by gender, age or nationality”. Enterotypes dominated by 
Bacteroides or Prevotella are associated with the consumption of a diet 
rich in protein and animal fat, or carbohydrates, respectively”. The 
Ruminococcus enterotype is not well separated and is partly merged 
with the Bacteroides enterotype”™. This division supports the associa- 
tion between Prevotella and a diet high in carbohydrates, which was 
seen in children from rural Africa”’. A 10-day dietary intervention, 
however, was not sufficient to alter the enterotype of an individual”, 
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BOX 2 
Dominant microbes 


The human gut microbiota is dominated by five bacterial phyla 
(Firmicutes, Bacteroidetes, Actinobacteria, Proteobacteria and 
Verrucomicrobia) and one Archaea (Euryarchaeota). The less 
prevalent bacterial groups are distributed among Cyanobacteria, 
Fusobacteria, Lentisphaerae, Spirochaetes and TM7. 

The Firmicutes phylum contains relevant genera, including 
Ruminococcus, Clostridium, Lactobacillus (several strains of 
which are probiotics), and the butyrate producers Eubacterium, 
Faecalibacterium and Roseburia. 

In Bacteroidetes, Bacteroides, Prevotella and Xylanibacter 
degrade a variety of complex glycans. 

The Actinobacteria phylum includes Collinsella and 
Bifidobacterium (which contains probiotic strains). Common 
Proteobacteria are Escherichia (from the Enterobacteriaceae 
family) and Desulfovibrio (which contains sulphate-reducing 
bacteria). Verrucomicrobia was recently discovered and includes 
Akkermansia (which are specialized for mucus degradation). 
Euryarchaeota contains the prevalent Methanobrevibacter (which 
is involved in the continuation of intestinal methanogenesis). 


suggesting that a long-term change may be required to provoke a 
major shift in gut microbiota composition. 

Changes in daily carbohydrate intake may affect specific groups of 
colonic bacteria over a short period of time. Consumption of the prebi- 
otic inulin increases the levels of E prausnitzii and Bifidobacterium sp. 
in humans”. Similarly, prebiotics promote a selective increase in 
Bifidobacterium sp. in diet-induced obese mice, and this increase 
is correlated with reduced adiposity and levels of microbe-derived 
inflammatory molecules, such as lipopolysaccharide, compared with 
mice that are fed a high-fat diet without prebiotics”. Human diets that 
are supplemented with resistant starch have increased faecal levels of 
Ruminococcus bromii and Eubacterium rectale, which correlates with 
fibre fermentation*’. Consumption of resistant starch also improves 
insulin sensitivity’, but the variation in the microbial response to 
changes in resistant starch between individuals suggests successful 
dietary interventions need to be personalized”. 

The gut microbiota also reacts to dietary fat. Mice fed on high-fat 
diets have reduced numbers of Bacteroidetes, and increased numbers 
of Firmicutes and Proteobacteria”. This change is rapid, occurring 
within 24 hours”. Transplantation of the caecal microbiota from 
obese mice fed on high-fat diets into germ-free recipients increases 
adiposity significantly more than transplantation of a lean micro- 
biota**. The altered microbial community of obese mice seems to 
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have some role in promoting diet-induced obesity, but the mecha- 
nisms that cause this are unknown. A change in diet clearly alters 
the gut microbiota, and these alterations may contribute to the host’s 
metabolic phenotype. Further metatranscriptomic and proteomic 
studies should provide insight into the response of microbial func- 
tion as a result of a dietary shift. 


Microbial processing of food constituents 

Products of microbial metabolism act as signalling molecules and 
influence the host’s metabolism. Microbial products directly affect 
intestinal function but may also affect the liver and brain, as well 
as adipose and muscle tissue, which consequently may affect the 
level of obesity and the associated comorbidities (Fig. 2). Micro- 
bial enzymatic activities can act directly on the fermentation of 
polysaccharides and bile-acid metabolism, or act in conjunction 
with the host on the metabolism of choline (Fig. 3). 


Fermentation of polysaccharides 

Non-digestible carbohydrates are important sources of energy 
for several members of the colonic microbiota. Species such 
as B. thetaiotaomicron and Bacteroides ovatus contain more than 
twice the number of glycosidase and lyase genes than the human 
genome and are capable of using nearly all of the main plant and 
host glycans (such as mucus-associated glycoproteins)**”’. Of the 
SCFAs produced from microbial fermentation, butyrate is particu- 
larly important as an energy substrate for cellular metabolism in the 
colonic epithelium. The colonic epithelial cells of germ-free mice are 
severely energy-deprived and are characterized by increased activa- 
tion of AMP-activated protein kinase (AMPK), which senses cellular 
energy status™*. This is also true of the liver of germ-free mice”’. The 
liver metabolism of germ-free and colonized mice differs consider- 
ably, possibly because of the increased influx of SCFAs into the liver 
of colonized mice (Fig. 1). Acetate and propionate are taken up by 
the liver and used as substrates for lipogenesis and gluconeogen- 
esis. Colonized mice have higher levels of stored triglycerides in the 
liver and an increase in the synthesis of very-low-density lipopro- 
teins”, which transport triglycerides from the liver to other tissues. 
Increased triglyceride production in the liver of colonized mice is 
associated with reduced expression of fasting-induced adipose fac- 
tor, or ANGPTL4, in the small intestine*®”. ANGPTL4 is a potent 
inhibitor of the enzyme lipoprotein lipase, which mediates cellular 
uptake of triglycerides. Germ-free Angptl4-deficient mice gained as 
much fat mass and body weight during high-fat feeding as colonized 
mice, indicating that ANGPTL4 directly mediates microbial regula- 
tion of adiposity in mice*”. 

SCFAs also affect proliferation, differentiation and modulation of 
gene expression in mammalian colonic epithelial cells*'. However, 
these effects have been attributed to butyrate acting as a potent his- 
tone deacetylase inhibitor and, as such, it may regulate 2% of the 


Figure 1 | Effects of colonic fermentation of dietary 
fibres. Complex carbohydrates, such as dietary 

fibre, are metabolized by the colonic microbiota to 
oligosaccharides and monosaccharides and then 
fermented to short-chain fatty acid end-products, 
mainly acetate, propionate and butyrate. Short-chain 
fatty acids are absorbed in the colon, where butyrate 
provides energy for colonic epithelial cells, and acetate 
and propionate reach the liver and peripheral organs, 
where they are substrates for gluconeogenesis and 
lipogenesis. In addition to being energy sources, 
short-chain fatty acids control colonic gene expression 
by inhibiting the enzyme histone deacetylase (HDAC) 
and metabolic regulation by signalling through 
G-protein-coupled receptors (GPCRs), such as GPR41 
or GPR43. 
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mammalian transcriptome”. In addition, SCFAs can regulate gene 
expression by binding to the G-protein-coupled receptors (GPCRs) 
GPR41 (also known as FFAR3) and GPR43 (also known as FFAR2). 
Signalling through these receptors affects several different functions 
depending on the cellular type. For example, SCFAs suppress inflam- 
mation through GPR43 signalling in immune cells, such as neutro- 
phils”? , and modulate secretion of the hormone GLP-1 — which 
improves insulin secretion and has antidiabetic effects — by enter- 
oendocrine L-cells in the distal small intestine and colon™. In addi- 
tion, the gut microbiota induces Pyy expression by L-cells through a 
GPR41-dependent mechanism. Conventional Gpr41-deficient mice 
have reduced adiposity compared with conventional wild-type mice, 
whereas germ-free wild-type and Gpr41-deficient mice had similar 
adiposity”, indicating that the effect of the microbiota on fat deposi- 
tion is dependent on this SCFA receptor. 

Microbial fermentation of polysaccharides may affect host adi- 
posity through several complementary mechanisms, so modulation 
of the microbiota and its fermentation capacity may provide new 
avenues for managing obesity. 


Microbial regulation of bile-acid metabolism 
The primary bile acids cholic acid and chenodeoxycholic acid are 
synthesized in the human liver from cholesterol, and are important 
for ensuring that cholesterol, dietary fats and fat-soluble vitamins 
from the small intestine are soluble and absorbable. Primary bile 
acids are conjugated to taurine in mice and to glycine in humans, and 
are taken up in the distal ileum for transport to the liver. However, 
bacteria in this part of the ileum deconjugate these bile acids, which 
then escape intestinal uptake and can be further metabolized by the 
gut microbiota into secondary bile acids. Because the gut microbiota 
transforms bile acids, germ-free rodents have more bile acid anda 
less diverse profile than their conventionally raised counterparts. 
Bile acids also function as signalling molecules and bind to cellular 
receptors”, such as the bile-acid-synthesis controlling nuclear recep- 
tor farnesoid X receptor (FXR)* and the GPCR TGRS. Both FXR and 
TGRS have been implicated in the modulation of glucose metabo- 
lism in mice, but FXR impairs, whereas TGR5 promotes, glucose 
homeostasis”. In contrast to FXR, which is activated by primary 
bile acids, TGR5 binds secondary bile acids such as deoxycholic 
acid (formed from cholic acid) and lithocholic acid (formed from 
chenodeoxycholic acid). TGR5 signalling in enteroendocrine L-cells 
induces secretion of GLP-1, thereby improving liver and pancreatic 
function and enhancing glucose tolerance in obese mice™. Bile acids 
are taken up from the gut and circulated throughout the body, so 
activation of TGR5 and FXR in peripheral organs may contribute 
to overall host metabolism. Activation of TGR5 in brown adipose 
tissue and muscle increases energy expenditure and protects against 
diet-induced obesity’. The gut microbiota may therefore contribute 
to the level of obesity and type 2 diabetes by controlling lipid and 
glucose metabolism though the composition of bile-acid pools and 
the modulation of FXR and TGRS signalling. 


Microbial metabolism of choline 

Choline is an important component of cell membranes and is mostly 
obtained from foods such as red meat and eggs, but may also be synthe- 
sized by the host. Choline is also important for lipid metabolism and 
synthesis of very-low-density lipoprotein in the liver, and insufficient 
levels in the diet are associated with altered gut microbial ecology and 
liver steatosis in mice*’ and humans”. In particular, low quantities of 
Gamma-proteobacteria and high levels of Erysipelotrichi in human 
faecal microbiota are associated with hepatic steatosis”. Microbial 
and host enzymatic activities interact in choline’s transformation into 
toxic methylamines, so trimethylamine that is produced by intestinal 
microbes can be further metabolized to trimethylamine-N-oxide in 
the liver’. These transformations may decrease the levels of bio- 
available choline and are suggested to trigger non-alcoholic fatty liver 
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Figure 2 | Features of the gut microbiota that promote obesity and insulin 
resistance. Alterations to the composition and metabolic capacity of gut 
microbiota in obesity promote adiposity and influence metabolic processes 
in peripheral organs, such as the control of satiety in the brain; the release 

of hormones from the gut (shown as PYY and GLP- 1); and the synthesis, 
storage or metabolism of lipids in the adipose tissue, liver and muscle. 
Microbial molecules also increase intestinal permeability, leading to systemic 
inflammation and insulin resistance. 


disease (NAFLD) in mice”. An altered gut microbial composition 
and its capacity to metabolize choline may have an important role in 
modulating NAFLD as well as glucose homeostasis”. 

Furthermore, plasma levels of trimethylamine- N-oxide and its 
metabolites are correlated with cardiovascular disease” (Fig. 3). The 
effect of microbial choline metabolism in cardiovascular disease is 
shown by the reduction of atherosclerosis in atherosclerosis-prone 
Apoe’ mice treated with broad-spectrum antibiotics”. A gut micro- 
biome with different capacities to process cholesterol and choline 
may contribute to the development of cardiovascular diseases. 


Regulation of permeability and inflammation 

Obesity, insulin resistance and development of type 2 diabetes are 
associated with systemic and adipose tissue inflammation”. The gut 
microbiota is a rich source of molecules such as lipopolysaccharide 
and peptidoglycan that may cause inflammation in peripheral tissues 
of the body. Colonization of germ-free mice with Escherichia coli is 
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sufficient to augment macrophage infiltration of adipose tissue and 
polarize macrophages towards the expression of pro-inflammatory 
cytokines’’. Plasma lipopolysaccharide levels increase in patients 
with type 2 diabetes”, and feeding lipopolysaccharide to mice for 4 
weeks increase adipose tissue inflammation and reduce insulin sen- 
sitivity’. These findings suggest that the gut microbiota may affect 
host metabolism by altering adipose tissue inflammation. Higher 
numbers of T cells®® and mast cells®, and lower numbers of regu- 
latory T cells” are also involved, but if and how the gut microbiota 
affects these cells and whether such interactions contribute to meta- 
bolic abnormalities is unclear. 

Plasma lipopolysaccharide levels seem to rise with higher fat 
intake in mice and humans™”. Two hypotheses have been made 
to explain the mechanism: lipopolysaccharide is taken up with 
dietary fats in chylomicrons”, or lipopolysaccharide reaches the 
circulation because the gut is more permeable in obese mice”. 
A connection between metabolism and the function of the epi- 
thelial barrier is thought to exist. Targeted deletion of fatty acid 
synthase — encoded by the Fas gene — in the gut epithelium of mice 
showed that epithelial de novo lipogenesis is required to maintain 
barrier function”’. Fas-deficient epithelium has increased perme- 
ability and, as a result, increased colonic levels of proinflammatory 
cytokines and high serum lipopolysaccharide. These phenotypes 
were corrected by antibiotic treatment, suggesting a reciprocal 
interaction between microbiota, altered epithelial permeability and 
host metabolism. 

A similar connection between gut permeability and type 2 diabe- 
tes in humans could also be present. Permeability is correlated with 
increased visceral adiposity and hepatic steatosis”, and those with 
high visceral adiposity and type 2 diabetes have increased levels of 
bacterial DNA in their blood”. However, inflammation may increase 
permeability in the gut, and further investigation into whether 
increased permeability causes adipose inflammation or increased 
inflammation contributes to increased permeability is needed. 
Either way, the gut microbiota modulates permeability that may 
contribute to adipose inflammation and cause insulin resistance. 
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Figure 3 | Diet-independent and -dependent microbial effects on host 
metabolism. The gut microbiota produces pro-inflammatory molecules, 
such as lipopolysaccharide and peptidoglycan, which may affect host 
metabolism through proteins produced by the host to mediate the 
immune response. Choline, cholesterol and polysaccharides obtained 
from the diet are metabolized by the gut microbiota and either directly 

or through further host-microbial co-metabolization generate bioactive 
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FMO 
host 


Lipopolysaccharide molecules bind to Toll-like receptor 4 (TLR4), 
and peptidoglycan to nucleotide-binding oligomerization domain 
(NOD) receptors, both of which activates proinflammatory signalling 
cascades*”*”*, Deletion of TLR4 in haematopoietic cells by generating 
bone-marrow chimaeras shows that TLR4 activation in macrophages 
of mice fed a high-fat diet is required for the development of fasting 
hyperinsulinaemia, and insulin resistance in liver and adipose tissue 
but not for the development of obesity”’. The innate immune system, 
however, also modulates microbial composition, which may have 
autonomous effects on host metabolism. Mice deficient in TLR5 have 
an altered microbial ecology and exhibit metabolic-syndrome signs, 
such as obesity, insulin resistance and dyslipidaemia, which are, in part, 
associated with increased food consumption”. Transplantation of the 
gut microbiota from Tlr5-deficient and wild-type mice into germ-free 
recipients shows that the phenotypes are transmissible, and suggests 
that the gut microbiota alone can mediate disease. 

Microbe-associated molecular patterns, including lipopolysaccharide 
and peptidoglycan, can be recognized by nucleotide-binding domain 
and leucine-rich-repeat-containing proteins (NLRPs), which form the 
inflammasome complex together with the apoptosis-associated speck- 
like protein containing a caspase activation and recruitment domain 
(ASC)’*. Obesity is associated with increased adipose expression of 
NLRP3 in mice and ablation of NLRP3 enhances insulin signalling”. 
However, inflammasomes may be linked to gut microbiota and host 
metabolism (Fig. 4). NLRP3, NLRP6 and ASC are important regulators 
of microbial ecology in mice, and deletion of their genes increases the 
number of Bacteroidetes (Prevotellaceae) and TM7*™. In particular, 
deficiency in Nirpé6 results in altered gut microbial ecology that predis- 
poses mice to colitis®’, and inflammasome complexes that do not con- 
tain NLRP3 or NLRP6 are associated with an altered gut microbiota and 
promote NAFLD and non-alcoholic steatohepatitis (NASH)*. Impor- 
tantly, wild-type mice housed with disease-prone Asc’ mice develop 
NAELD or NASH, providing direct evidence that an altered gut micro- 
biota may cause these diseases”. Alterations to gut microbiota composi- 
tion are associated with an increased influx of TLR4 and TLR9 ligands 
— presumably lipopolysaccharide and bacterial DNA — respectively, 
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compounds. In the case of choline, this can lead to cardiovascular disease; 
for cholesterol, activation of TGRS5 can increase energy expenditure 

and GLP-1 secretion or protection against heart disease; and for 
polysaccharides, short-chain fatty acids can be used as an energy source 
or can bind to GPR41 or GPR43 to regulate hormones and modulate 
inflammation. FMO, flavin-containing monooxygenase; TLR4, Toll-like 
receptor 4; TMA, trimethylamine; TMAO, trimethylamine-N-oxide. 
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Altered microbial composition 


Figure 4 | Different microbial innate immune mechanisms affect host 
metabolism in the gut and liver. Plasma lipopolysaccharide seems to rise 
with higher fat intake, and those with high visceral adiposity have higher levels 
of microbial DNA in their blood. Both lipopolysaccharide and microbial DNA 
seem to be connected with gut permeability. NLRP3 and 6 are both important 
regulators of microbial ecology through the effector protein interleukin-18 


to the liver through the portal vein®’. Mice deficient in TLR signalling in 
the liver are therefore protected from developing conditions related to 
metabolic syndrome such as obesity, NAFLD and NASH” (Fig. 4). The 
interaction between diet, host and gut microbiota may modulate gut 
permeability that leads to an influx of proinflammatory molecules and 
subsequent activation of inflammatory signalling pathways in periph- 
eral tissues that may cause obesity, steatosis and insulin resistance. 


Future research 
The gut microbiota is increasingly being accepted as an environmen- 
tal factor that affects host metabolism and contributes to associated 
pathological conditions, such as obesity, diabetes and cardiovascular 
disease. However, the contribution that the gut microbiota makes to 
causing obesity and diabetes in humans is unclear. This is probably 
because the heterogeneous aetiology of obesity and diabetes can be 
associated with different microbes; studies are underpowered and 
include participants with diverse ethnic origin and food habits; the 
composition of the gut microbiota has large interpersonal variation; 
and different methods, with specific biases, have been used to profile 
the microbiota. Cheaper sequencing and improved bioinformatics 
tools for the analysis of the gut microbiota will allow more research- 
ers to use metagenomic sequencing and avoid primer and polymerase 
chain reaction biases linked to 16S rRNA gene surveys. Although use- 
ful, metagenomic approaches should be complemented by metatran- 
scriptomics and metaproteomics to assess which microbial genes and 
proteins are expressed in specific conditions. One of the main chal- 
lenges is to obtain robust predictive biomarkers for obesity and diabetes 
on the basis of the gut microbiota, for which improved study designs 
and analytical methods are essential. Much of the focus has been on 
the faecal microbiota, but many metabolic functions also occur in the 
small intestine. Sampling intestinal specimens might contribute to 
the identification of microbial biomarkers for health and disease, and 
although challenging, more emphasis should be placed on examining 
its microbiota and the effects on host metabolism. 

Studies in humans tend to be correlative, so the role of the microbiota 
in obesity and its comorbidities in humans remains to be proven. How- 
ever, this role can be examined in animal studies. Germ-free mice can be 


lipopolysatcharides 


(IL-18). The altered gut microbiota can stimulate CCL5 secretion, which 

can result in increased permeability and influx of microbial components. In 
the liver, lipopolysaccharide and bacterial DNA activate the receptors, TLR4 
and 9, leading to increased tumour-necrosis-factor-a (TNFa) secretion and 
development of non-alcoholic fatty liver disease (NAFLD) and non-alcoholic 
steatohepatitis (NASH). 


‘humanized’ by colonizing them with human intestinal communities, 
providing tools for examining the function of a specific human micro- 
biota and testing how it interacts with specific diets. Genetically engi- 
neered germ-free mice could help to identify the molecular mechanisms 
by which the gut microbiota affects host metabolism. Pigs have similar 
gastrointestinal tracts and diets to humans, so they could be useful ani- 
mal models in which to test dietary interventions and to manipulate the 
gut microbiota to improve health and prevent disease. 

Accumulating evidence indicates that the gut microbiota may be a tar- 
get for treating metabolic diseases’. Supplementing the diet with non- 
digestible food ingredients, or prebiotics, that stimulate the expansion 
of specific microbes to improve metabolic regulation can be a therapy. 
Probiotics may be an interesting approach for prevention of obesity and 
related diseases. But to determine the effects of both of these therapies, 
double-blind, placebo-controlled studies are required. Therapies that 
replace unhealthy with a healthy microbiota through transplantation 
have been used successfully since 1958 for the treatment of antibiotic- 
related diarrhoeal colitis’. Recently, transplantation of healthy lean 
microbiota improved insulin signalling in participants with metabolic 
syndrome™. Although a promising technique, transmission of unknown 
and potentially pathogenic bacteria and viruses from an unfractionated 
gut microbiota may have risks for the recipient. Using microbiota-based 
interventions to treat obesity will require probiotics that are selected for 
specific clinical manifestations of metabolic syndrome. m 
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Genomic approaches to studying 
the human microbiota 


George M. Weinstock’ 


The human body is colonized by a vast array of microbes, which form communities of bacteria, viruses and microbial 
eukaryotes that are specific to each anatomical environment. Every community must be studied as a whole because many 
organisms have never been cultured independently, and this poses formidable challenges. The advent of next-generation 
DNA sequencing has allowed more sophisticated analysis and sampling of these complex systems by culture-independent 
methods. These methods are revealing differences in community structure between anatomical sites, between individuals, 
and between healthy and diseased states, and are transforming our view of human biology. 


he microbes that exist in the human body are collectively 

known as the human microbiota. This amazingly complex and 

poorly understood group of communities has an enormous 
impact on humans. An increasing number of conditions are being 
examined for correlative and causative associations with the micro- 
biome — which, in this Review, is used to refer to the microbiota 
and the habitat it colonizes (Box 1). Each one of the many microbial 
communities has its own structure and ecosystem, depending on 
the body environment it exists in. The fundamental goal of human 
microbiome research is to measure the structure and dynamics of 
microbial communities, the relationships between their members, 
what substances are produced and consumed, the interaction with the 
host, and differences between healthy hosts and those with disease. 

Despite an explosion in human-microbiome research, these com- 
munities are still the dark matter of the body. The microbiome has 
been called another organ’ * because of its products, its responsive- 
ness to the environment and its integration with other systems. 
Sometimes referred to as our second genome’, the genes of microbes 
that make up the microbiome outnumber human genes by more 
than 100-fold, with over 3 million bacterial genes in the gut alone®”. 
These extensive microbial ecosystems are not limited to the human 
body. Microbes and their communities dominate the environment 
and occupy a vast range of niches. Environmental metagenomics was 
developed extensively before being applied to the human body*”, 
and methods from other disciplines have had a significant effect on 
human-microbiome research. Defining complicated microbial eco- 
systems and developing tools to probe their workings is an important 
research enterprise of twenty-first century microbiology. 

The complexity of microbial communities makes studying them 
challenging. There may be hundreds of different species, and enu- 
merating what organisms are present with standard microbiological 
techniques is not possible because many organisms have never been 
grown in culture and may require special, as yet unknown, growth 
conditions. In addition, the abundance of some microbes can range 
over orders of magnitude, so deep sampling is required to detect the 
less-abundant members. Culture-independent methods of taking a 
microbial census began about 25 years ago and were based on tar- 
geted sequencing of 5S and 16S ribosomal RNA genes”, which differ 
for each species and are a convenient identifier. As this became a 
tractable research area, next-generation sequencing (NGS) technolo- 
gies (Table 1) were developed and allowed more extensive analyses, 


both targeted 16S rRNA gene sequencing and whole-genome shot- 
gun sequencing of microbes in communities en masse. The number 
of culture-independent metagenomic investigations of the human 
microbiome has mushroomed, and it is one of the most studied areas 
of microbiology with significant potential to benefit clinical practice. 
This culture-independent methodology is broadly applied outside 
human-microbiome research and is expanding our knowledge of 
the environment. This Review describes how NGS approaches are 
transforming human-microbiome studies, and posing questions and 
challenges for the future. 


Single organisms and microbial communities 
In the past, research on microbial interactions with humans has 
focused on single pathogenic organisms. Studies of communities 
of non-pathogenic microbes in the body were limited because the 
organisms were thought to be benign, with minor effects on human 
health compared with pathogens. Microbiome research has led to new 
interest in the communities of non-pathogenic microbes that inhabit 
the human body, and the need to describe the genomes of these organ- 
isms to understand the human microbiome has been recognized. 
Every community of the microbiome has its own characteristics 
(Table 2). For the gut community, for example, high biodiversity is 
associated with a healthy state and reduced biodiversity occurs in 
patients with conditions such as Crohn’s disease!!, whereas for tis- 
sues of the vagina, a lower biodiversity exists in healthy individuals 
and a bloom of organisms occurs in patients with vaginosis’. To 
understand why different sites have different properties, the mecha- 
nisms that lead to the disruption of ecosystems and to disease, and 
exceptions to generalities about a tissue, researchers require knowl- 
edge of the structure and behaviour of microbial communities. 
Microbial communities benefit the host by providing functions 
such as digestion of nutrients’ or protection against infection”. 
Antibiotic treatment perturbs the microbiome’*"® by reducing 
its size and altering its composition. This disturbance can lead to 
infection!”"”’, and antibiotic-resistant organisms such as Clostridium 
difficile — normally controlled by the microbiome — can overgrow 
and create problems”. More complex community contributions 
also exist, such as interactions with host immune and inflamma- 
tory systems’ or production of metabolites involving hybrid 
pathways from multiple organisms, including host-microbe path- 
ways’. Understanding these phenomena will ultimately allow the 
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microbiome to be manipulated so that, for example, transplants of 
microbial communities could treat C. difficile infections*”. 

Whether the microbial ecology of the human body can be sim- 
plified to the properties of single organisms is unknown. Many 
organisms have never been cultured and may be adapted to life ina 
community environment rather than a pure culture. For organisms 
for which growth requirements are understood, there is a depend- 
ence on secreted products from other community members. For 
example, secreted siderophores” are small molecules that help 
microbes to scavenge iron, which is a limiting factor for growth in 
the body. So even the study of individual organisms can be depend- 
ent on studying the community. 


Dissecting a microbiome 

Analysis of community structure (Fig. 1) focuses on either targeted 
regions (such as the 16S rRNA gene) or shotgun sequencing to 
catalogue the genes that are present. Additional analysis involves 
sequencing genomes of individual organisms to produce a catalogue 
of reference genomes”, and analysing RNA to describe the tran- 
scriptome and identify RNA viruses. Non-genomic analyses include 
proteomic and metabolomic studies, but these are not discussed 
here. Every sample should be well-annotated with clinical metadata, 
so that, ultimately, the microbiome’s genetic and community struc- 
tures can be correlated with the individual’s phenotype. 


Census of organisms 

Modern metagenomic analyses of microbial communities were 
developed from culture-independent methods for taking a cen- 
sus of organisms present in a community and their abundances. 
Although DNA reassociation kinetics provides information on 
community diversity and structure’, there is no accounting for 
organisms that may be tracked between samples. Methods more 
useful for providing information on the entire structure often focus 
on signature sequences that distinguish taxa (detected by hybridiza- 
tion to arrays of diagnostic oligonucleotides”), various methods for 
fingerprinting polymerase chain reaction (PCR) products (such as 
single-strand conformation polymorphisms or terminal restriction 
fragment length polymorphisms) or DNA sequencing of targeted 
PCR products. Sequencing of 16S rRNA genes is the main method 
of taking a community census because fingerprinting methods do 
not adequately measure low-abundance organisms”. 

16S rRNA differs for each bacterial species. A bacterial species is 
hard to define, but is often taken as organisms with 16S rRNA gene 
sequences having at least 97% identity — an operational taxonomic 
unit (OTU). A 16S rRNA gene sequence of about 1.5 kilobases has 
nine short hypervariable regions that distinguish bacterial taxa; 
the sequences of one or more of these regions are targeted ina 
community census. 

Before the introduction of NGS methods, the prevailing approach 
was to clone full-length 16S rRNA genes after PCR with primers that 
would amplify genes from a wide range of organisms. Cloned 16S 
rRNA genes were sequenced by the Sanger method, which required 
two or three reads to cover the entire gene. Accuracy was crucial 
because sequencing errors led to misclassification. The cost and 
effort required for the Sanger method limited the depth of sampling, 
and studies often produced about 100 sequences per specimen. This 
method identified the dominant organisms in a community, but 
analysis of less abundant organisms was limited. 

Introducing NGS to 16S rRNA gene analysis led to marked 
improvements in cost and depth of sampling. The Roche-454 
platform has dominated microbial community analysis’. As the read 
length for 454 pyrosequencing is about 400 bases, only a portion of 
the 16S rRNA gene can be sampled, and many different studies have 
targeted between one and three of the hypervariable regions, with 
different hypervariable regions targeted in different studies. Using 
a portion of the 16S rRNA gene led to a loss of sensitivity (some taxa 
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BOX1 
Terminology 


@ Biodiversity is a measure of the complexity of a community. 

It is affected by the number of taxa (richness) and their range 

of abundance (evenness). High biodiversity occurs when many 
taxa (high richness) are present at similar abundances (an 

even distribution). 

@ Commensals are organisms that benefit from another organism 
but that have no harm or benefit themselves. Microbes of the 
microbiome were thought to be commensals that benefited from 
the human host but did no harm. Many of these organisms provide 
benefits to the human host and so have a mutualistic relationship. 
@ Contig is a stretch of contiguous sequence in a genome assembly. 
@ Coverage is the number of times a genome or gene is sequenced. 
In a genome sequenced to coverage, each nucleotide in the 
sequence appears, on average, in 100 reads. 

e@ Genome assembly is the process of constructing a genome 
sequence from short subsequences by sequencing many random 
fragments from a sheared genome. The random short sequences 
are compared, and overlapping common sequences are used to 
determine their orientation and order with respect to each other. A 
consensus sequence is constructed from this layout. Usually there 
are gaps, but when contigs can be arranged in the correct order and 
orientation, these longer stretches are called scaffolds. 

@ Metagenomics was defined® as a process for identifying 

genes specifically by their function by cloning them directly from 
the environment and expressing genes in a surrogate host™. 
Therefore, gene function was known even if the sequence was 

not sufficient for functional inference, such as when it encoded 

a protein of previously unknown function. This definition, also 
known as functional metagenomics, is widely used. More recently, 
metagenomics refers to general analyses of microbial communities 
by culture-independent methods, which do not necessarily focus on 
function. The combined genomes of the microbes in a community 
are thought of as the community metagenome. Another type of 
metagenomic analysis focuses on the structure of these aggregate 
genomes in a community. 

@ Microbiome in this Review refers to the microbiota and the 
habitat it colonizes and is analogous to the term biome in ecology. 
Microbiome is also used to refer to the collective genomes of the 
microbes — what is now the metagenome, and may have originally 
been coined by Joshua Lederberg (cited by Hooper and Gordon®). 
However, it is also used for the more ecologically consistent 
meaning. A microbiome can be a specific body site, such as the gut 
microbiome, but the human microbiome is often used to refer to the 
collection of microbiomes of the human body. 

@ Mutualism is a type of symbiosis in which both organisms benefit. 
This is one type of relationship seen in the human microbiome. 

@ Operational taxonomic unit in microbiome research is a group 

of organisms with 16S ribosomal RNA gene sequences that show a 
certain level of identity. This group is often used as a surrogate for a 
species when the 16S rRNA sequences are at least 97% identical. 
@ Pathogenic microbe is one with the potential to cause disease. 

@ Read is the primary output of DNA sequencing, consisting of a 
short stretch of DNA sequence that is produced from sequencing a 
region of a single DNA fragment. 

@ Shotgun sequencing is the process of randomly breaking (often 
by shearing) a long DNA molecule (for example, a complete 
chromosome) and then sequencing the resultant DNA fragments, 
which each come from a different location in the original long 

DNA molecule. 

@ Virome is the collection of viruses in the microbiota. 
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Table 1| DNA sequencing platforms used for microbiome analysis 


Platform Method Characteristics 16S rRNA Shotgun Comments 
Established 
Sanger-based or Fluorescent, dideoxy 750-base reads Full length sequenced Long reads help with Most costly method 


capillary-based terminator High accuracy 

instrument 

Roche-454 Pyrosequencing light 400 -base reads 
emission 

Illumina Fluorescent, stepwise 100-150- base reads 


with 2-3 reads 


Up to 3 variable 
regions per read 


Only 1 variable region 


database comparisons Relatively low throughput, so low 


coverage of 16S or shotgun 


Long reads help with 
database comparisons 


Cost limits shotgun coverage but 
16S coverage is good 


Short reads do notseem_ Very high coverage owing to high 


sequencing per read to limit analysis instrument output and very low cost 
Not yet widely used 
lon Torrent Proton detection More than 200-base _ Like other NGS Like Illumina Expect high coverage, but longer 
reads reads than Illumina 
PacBio Fluorescent, single-molecule Up to 10-kilobase Accuracy an issue Long reads could help _ Attractive for long reads, but lower 
sequencing reads for correct taxon assembly accuracy limits applications 


Low accuracy 


Oxford Nanopore* Electronic signal as DNA 
passes through pore 


Single-molecule sequencing 


Long reads 


Unknown 


identification 


Long reads could help 
assembly 


Not yet available 


*At the time of publication, the Oxford Nanopore system was not available, and information provided is based on company presentations. lon Torrent and PacBio are both available but have not been widely 
used for microbiome analysis. The Illumina MiSeq instrument is expected to provide 250-base reads in the near future. 


cannot be reliably defined at the species level, although high con- 
fidence identification of higher taxonomic ranks is possible), nev- 
ertheless gains in depth of sampling and cost savings outweigh this 
caveat. The US Human Microbiome Project (HMP)” has sequenced 
more than 10,000 specimens from healthy adults on the 454 platform 
by targeting V3 to V5 regions in the 16S rRNA gene and producing, 
on average, 7,000 sequences per specimen”, which is a vast expan- 
sion on the Sanger method of sequencing analysis. The results of 
the HMP, which sampled 18 body sites, provide an in-depth defini- 
tion of the human microbiome. Another study"® that focused on the 
effects of the antibiotic ciprofloxacin reported the ‘rare biosphere’ in 
the gut. This study documented perturbation of taxa and recovery 
from antibiotic treatment, as well as minor constituents that did not 
recover after antibiotic treatment. Such analyses will be important in 
identifying individuals who are at risk of side effects from antibiotic 
treatment, for example overgrowth of pathogens such as C. difficile 
or life-threatening antibiotic-associated diarrhoea. 

When using 16S rRNA gene sequencing to compare individu- 
als it is not necessary to know which organisms are present, only 
whether the spectra of 16S rRNA gene sequences are similar and 
the degree of difference between samples. Projects that compare 
healthy cohorts and those with disease to determine whether there 
is a difference in the microbiome, or examine the effects of diet, 
antibiotic treatment or environmental factors on the microbiome, 
all focus on detecting differences in communities, rather than iden- 
tifying actual taxa. A loss of sensitivity for organism identification 
can be tolerated, and NGS allows cost-effective deep sampling of 
large cohorts, which is needed to reach statistically significant 
conclusions. The Illumina sequencing platform has been applied 
to metagenomics projects*”**, but because this sequencing plat- 
form currently produces reads of 100 bases (HiSeq system) to 150 
bases (MiSeq system), only a single hypervariable region can be 
sequenced. However, this further loss of sensitivity does not pre- 
clude the use of the IIlumina platform for the comparative projects 
already described in this Review. An early application of this plat- 
form was its use in a study of vaginal microbiomes in patients 
with HIV, for which comparisons of patients with conditions such 
as vaginosis before and after antibiotic therapy were examined”. 
As a result of the exceptional increases in numbers of reads and 
the lower cost associated with the Illumina platform, it is becom- 
ing more widely used for 16S rRNA gene-sequence profiling and 
continues the microbiome-analysis trend of deeper sampling at 
lower costs. 
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Shotgun sequencing for cataloguing organisms 

Targeted sequencing is a powerful tool for assessing the organisms that 
are present in microbial communities, but it is limited in terms of the 
functional and genetic information produced. Organisms for which 
the genome sequences are known (currently there are several thousand 
sequenced bacterial genomes) can be used to infer the genes and func- 
tional capabilities of the community (Fig. 1). However, many organisms 
have no reference sequence. Furthermore, a reference sequence does not 
completely describe the genes that are contributed by an organism. There 
is considerable variation in the genomes between strains of the same spe- 
cies. Two strains of Escherichia coli, O0157:H7 and K-12, both have 16S 
rRNA gene sequences of E. coli, but differ in hundreds of genes. There are 
limits to what can be learned about the genetic content of communities 
from 16S rRNA gene sequences alone. 

Moving beyond this level of functional inference requires a gene-based 
census. This catalogue of genes can be provided by shotgun sequencing 
of DNA that has been extracted from the community as a whole and 
samples the mixture of genomes that make up the metagenome (Fig. 1). 
Inacommunity in excess of hundreds of species with varying abundance, 
deep sequencing is needed to sample minor constituents that are not 
necessarily unimportant. The bacterial concentration in the gut can be 
10” cells mI” (refs. 38, 39), so for an organism that is present at a con- 
centration of 1 per 10° there are 10° cells ml”, which is sufficient for the 
organism's products, such as metabolites and toxins, to have an effect on 
the community and the host. 

Illumina sequencing of faecal samples produced 4 gigabases per sam- 
ple and 10 Gb per sample in the Metagenomics of the Human Intestinal 
Tract (MetaHIT)° and HMP* projects, respectively, which corresponded 
to tens of millions of reads per sample. At this depth of sequencing, the 
genomes of minor constituents such as E. coli (with an abundance of 
about 1% or lower) are sampled almost completely, and organisms with 
an even lower abundance have some of their genome represented. This 
extraordinary sampling of complex microbial communities is made 
possible by producing large amounts of data and by the low cost of 
NGS methods. 

Shotgun sequence data, in addition to 16S rRNA gene analysis, provide 
information on the organisms that make up communities. Extracting 
16S rRNA gene sequences from shotgun reads to determine the organ- 
isms present is possible; however, targeted 16S rRNA gene sequencing 
tends to introduce biases (owing to the broad-range PCR used to amplify 
16S rRNA gene sequences or the choice of region within the 16S rRNA 
gene), which shotgun sequencing does not. Shotgun sequencing is less 
sensitive than targeted rRNA sequencing because a small fraction of the 
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sequences are from 16S rRNA genes. Another approach is to align shot- 
gun sequences to bacterial reference genomes****”’, allowing the relative 
abundance of species to be determined on the basis of the number of 
reads that align to each reference genome (also useful for the comparative 
studies already described). The MetaHIT project has used this approach 
to classify individuals into different groups, called enterotypes, on the 
basis of the community structure in their faecal samples*’. The same 
enterotypes have been found in 16S rRNA gene-based analysis”. The 
vaginal microbiome has also been classified into five groups”. These 
observations suggest the human microbiome may exist in distinct states 
in different people, although correlation with environmental, genetic 
or health status is not yet clear. Stratifying future studies depending on 
which community class an individual belongs to may be important for 
identifying correlations with phenotypic data. 

The need for reference genome sequences is clear both to infer 
genetic content of organisms identified by 16S rRNA genes and to iden- 
tify sources of shotgun reads by aligning to reference genomes, and so 
determining organismal content of communities from shotgun data. 
NGS techniques have reduced the cost of bacterial sequences to less 
than US$1,000 per genome and led to an increase in the production of 
‘complete’ genome sequences. Current methodology relies mainly on 
Illumina shotgun sequencing and a variety of methods to assemble the 
reads into a genome. The product is not a true complete genome, but a 
high-quality draft that covers almost all of the genome and results in a 
high-quality base sequence”. Programmes such as the HMP**“* and the 
Genomic Encyclopedia of Bacteria and Archaea (GEBA)* are producing 
reference genomes by the thousands. 

Although bacteria are the main components of the human micro- 
biome, eukaryotic microbes and viruses (both human viruses and 
bacteriophages) are also present (Table 2). The study of eukaryotic 
microbes is not as advanced as that of bacteria*®, but the organisms are 
identified by signature sequences (such as fingerprinting and 18S rRNA) 
and shotgun sequencing analogous to bacteria. The number of reference 
genomes for eukaryotic microbes is smaller than that for bacteria, and 
progress will depend on addressing this shortfall. 

By contrast, considerable effort is being given to characterizing the 
genomes of human viruses” and bacteriophages”, known as the virome 
(Box 1). This work is based on shotgun sequencing (Fig. 1), although 
oligonucleotides microarrays for virus detection are also used”. Viral 
sequences can be detected in shotgun data from different body sites, and 
viruses can also be enriched by processing samples before DNA extrac- 
tion*’. Virome analysis by shotgun sequencing of microbial communities 
(discussed later) has led to the identification of human viruses” ™, as 
well as the detection of known viruses in healthy subjects and diseases 
of unknown aetiology’. Likewise, bacteriophages are found to be highly 
diverse at different body sites” >’, with differences between individuals 
asa result of diet™ or disease states”. 


Sequencing for gene catalogues and functional inference 
Metagenomic shotgun data also sample community gene con- 
tent, which is useful to define community capabilities and identify 
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particular members. Deep sequencing, such as that used in the 
MetaHIT and the HMP, broadly samples the genomes of even minor 
constituents, facilitating the identification of genes present within 
a given community (Fig. 1). By using the sequence reads them- 
selves, or by first assembling them into contigs (Box 1), sequence 
data can be compared with databases such as the National Insti- 
tutes of Health’s GenBank to identify which genes are present. De 
novo prediction of genes from metagenomic data is also possible”, 
which provides motifs for functional inference even if the sequence 
does not find a match in a database. Finally, alignment of reads or 
contigs to reference genomes identifies which organisms are pre- 
sent, along with their known gene content. These methods convert 
metagenomic sequence data into catalogues of genes that can be 
further analysed. 

Gene catalogues can be compared with databases such as the 
Kyoto Encyclopedia of Genes and Genomes (KEGG)”, which sorts 
gene products into pathways and processes. Such analyses provides 
lists of pathways, identify which pathway genes are in the community 
and quantify the abundances of genes and pathways”. Comparing 
gene catalogues to specialized metabolic databases, such as the 
Carbohydrate- Active Enzymes database™, is also useful. Carbohy- 
drate-degrading capabilities of communities differ between body 
sites, suggesting the carbohydrate spectrum of each body site has 
determined which organisms and pathways are present®. 

In addition to pathway analysis, determining the presence and 
abundance of genes, such as antibiotic-resistance genes or virulence 
factors, in a community is possible using similar methods to those 
already described, and can shed light on pathogen burden in an indi- 
vidual and consequences of antibiotic treatment. The importance 
of functional analyses cannot be overemphasized, and functional 
properties of communities are thought to be more important than 
their taxonomic composition”. 


Computational tools and strategies 

Metagenomic data are a rich source of information for the sequenc- 
ing and analysis methods already discussed”. The data analysis 
workflow has three phases. In the first phase, primary data are pro- 
cessed and filtered depending on the application. For 16S rRNA gene 
sequencing, the quality of analysis is important so that organisms are 
not misclassified. Initial processing addresses read quality, chima- 
erism (a read formed from different 16S rRNA genes), read length 
after removing low-quality bases and related issues” ”*. For shotgun 
sequence data®** — in addition to sequence quality — artefacts such 
as duplicate reads must also be addressed, as well as computation- 
ally removing contamination from human sequences. Removal of 
human and bacterial sequences is important in read processing for 
virome analysis’ (Fig. 1). 

Following production of processed reads, the second phase 
involves generating various derivative data sets. For 16S rRNA gene 
analysis, tables of taxa and abundance are produced by comparisons 
with 16S rRNA sequence databases or by using software packages to 


Table 2 | Characteristics of bacteria, microbial eukaryotes and viruses in the human microbiome 


Characteristic Bacteria Viruses 


Eukaryotic microbes 


Genome size 0.5-10 megabases 


Number of taxa in the At least thousands 


human microbiome 
Relative abundances Highly variable 


Targeted detection 
methods 


Sequencing of genes such as 5S and 16S 
rRNA 


Shotgun approach to 
analyses 


Alignment to reference genomes or 
database comparison 


Subspecies or strain 
diversity 


Modest sequence variation 
Horizontal gene transfer also contributes 


1-1,000 kilobases 


Unknown, but could be as many as bacteria 


Highly variable 


No universal method for genes, but virus-specific 
polymerase chain reaction assays for some 


Database comparison 


High sequence variation 


10-50 megabases 


Unknown, but may be fewer than 
bacteria 


Unknown 


Sequencing of 18S rRNA gene 
Spacer region in rRNA 


Alignment to reference genomes or 
database comparison 


Unknown 
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__ 


Figure 1 | Data and analysis workflow for microbiome analysis. From a 
microbiota sample, DNA, RNA and protein can be extracted, and metadata 
and strains of bacteria obtained. Data from DNA can be supplemented 

with proteome and transcriptome analysis. During primary analysis, 
shotgun techniques can produce reads from DNA, which are then aligned 

to reference genomes to identify variants and community population 
genetics, assembled into contigs to make gene predictions or compared 

with databases. Alternatively, targeted sequencing such as 16S rRNA gene 
sequencing can be used to take a community census, and these data are then 
compared with databases to create tables of taxa and abundance, or analysed 
with software programs to cluster the reads into OTUs to create tables of 
abundance. The derivative data is used in secondary analysis for ecological 
metrics or competition and symbiosis analysis. In addition, shotgun reads 
and comparisons with reference genomes and databases can be used to build 
pathways and reconstruct the capabilities of acommunity. The combination 
of these analyses will contribute to understanding the differences within and 
between individuals. 


* Depth and breadth, 
variation 
* Genes and pathways 


Correlations 


cluster the reads into OTUs”*’*. Comparing shotgun reads to gene 
databases, such as GenBank or KEGG, by using the Basic Local 


Alignment Search Tool (BLAST), for example, produces lists of 


genes and the number of matched reads”****. Alignment of reads to 
reference genomes produces tables of breadth and depth of coverage, 
by reads of each genome”". In each of these data sets, there is more 
biological information to be gleaned and added through further 
analysis. Not all reads match sequences in databases because not all 
organisms have a reference genome sequenced. In addition, reads 
may match genes whose function has not been elucidated. These 
sequences of unknown origin or function can be a sizeable fraction 
and the effect of this uninformative portion of data on analyses and 
conclusions is not clear. 

The third phase of analysis uses these derivative data to produce 
trees or other representations of the similarity of communities, 
abundance curves, biodiversity plots, and other ecological and sta- 
tistical descriptors of community structure”*” (Fig. 1). A list of hits 
from BLAST is used to build metabolic pathways for reconstruction 
of community capabilities®’. Alignments to reference genomes are 
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further analysed for variants and population genetics of communi- 
ties. Computational analysis can also be used to determine which 
organisms co-occur or rarely co-occur as evidence for symbiosis or 
competition, respectively, or to follow the dynamics of community 
structure in longitudinal time series”. 

Some analyses pose significant computational challenges. Com- 
parisons to gene databases at the protein level are particularly 
demanding because shotgun sequences must be translated into 
polypeptides in all six reading frames, and each must be compared 
with a gene database represented at the protein level. Using con- 
ventional BLASTx programs for this comparison in large data sets, 
such as the HMP, could take decades, so supercomputers, acceler- 
ated BLAST programs or both must be used”. A lack of efficient 
software and large enough computer clusters are often bottlenecks 
for metagenomic analysis, because sequencing and data production 
are not limiting factors. Management of large data sets and comput- 
ing resources are receiving more attention, with cloud-computing 
services seeming to bea viable alternative”. 


Future directions and challenges 

The rapid rise in metagenomic studies has solved many problems 
but, as the field has grown, other questions have been raised. Exist- 
ing methodology is becoming more sophisticated, and sequencing 
technology is making exponential advances (Table 1). The lumina 
platform introduced instruments that were more appropriate for 
sequencing smaller genomes, with faster run times and longer read 
lengths, offering more flexibility for metagenomic applications. The 
long read length of the PacBio platform has the potential to help 
distinguish the reads from different organisms, which is a challenge 
for metagenomic shotgun sequencing. The technology produced by 
Oxford Nanopore promises long reads and short run times in a scal- 
able system, and is therefore a good match for microbial applications. 
Reducing the amount of DNA needed for shotgun sequencing will 
allow communities in smaller anatomical regions, such as within the 
gastrointestinal tract, to be studied separately rather than together 
with other regions as is the case with the current methodology. Short 
run-time instruments and reductions in sample size will also hasten 
the introduction of microbiome analysis to the clinic, where analy- 
ses of patient samples must be quick and able to deal with limited 
amounts of material. Ultimately, the aim of human-microbiome 
research is its application as a diagnostic, therapeutic and preven- 
tive tool in the clinic. 

The main limitation of using shotgun data is the large number of 
organisms that have not been cultured, let alone sequenced. These 
organisms are therefore under-represented in databases, and their 
shotgun reads are anonymous. When community shotgun data are 
assembled into genomes to obtain genome sequences for new organ- 
isms, contig sizes are typically small as a result of lower organism 
abundance and the challenges associated with assembly of a complex 
mixture. The long read lengths of PacBio and Oxford Nanopore 
instruments should help with these challenges, as will the develop- 
ment of assembly algorithms for metagenomic data. Expanding the 
catalogue of reference genomes by producing reference sequences 
for individual uncultured organisms is an active area. Methods that 
use cell sorting to isolate organisms, coupled with sequencing and 
assembly techniques for single-cell DNA preparations, are produc- 
ing new genome sequences’*” and, in high-throughput mode, could 
complement shotgun metagenomics for analysing communities. 

One problem associated with genomic data is that it does not 
address whether an organism is alive or has succumbed to host 
defences or antibiotic treatment. However, the data can be comple- 
mented with transcriptome analysis, or proteomic and metabolomic 
data sets, which analyse gene expression and metabolic data that are 
more likely to be derived specifically from living cells. 

The simultaneous advances in human genetics and genom- 
ics offer opportunities for combining studies of host genotype 
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with microbiome phenotype. Methods for viewing the micro- 
biome as a quantitative trait and relating this to host genotype are 
being developed®. Advances in host-microbiome studies are also 
coming from combining immunology and human-microbiome 
research*"’. Moreover, continued development of statistical meth- 
ods in microbiome research, such as advances in power analysis, will 
aid experimental design and future analysis. m 
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